Hemocyanin nucleic acids and polypeptides and related vectors fusion protein conjugates cells and methods of use

ABSTRACT

An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH or a fragment thereof, as well as an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence complementary thereto; a vector comprising such a nucleic acid, optionally as part of an encoded fusion protein when the nucleic acid encodes KLH or a fragment thereof; a cell comprising and expressing an above-described nucleic acid, optionally in the form of a vector; an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-1, KLH-2, or a fragment of either of the foregoing, as well as a conjugate or fusion protein comprising the same and a therapeutically or prophylactically active agent, and related compositions; a method of treating cancer, pathogenic infection, and high blood pressure in a mammal; methods of stimulating an immune response in a mammal; a method of identifying an immunogenic portion of KLH, and an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding the leader sequence of KLH-1 or KLH-2, optionally coupled 5′ to an isolated or purified nucleic acid molecule encoding a polypeptide or protein, in which case the nucleic acid is optionally in the form of a vector, as well as a eukaryotic cell comprising and expressing same.

TECHNICAL FIELD OF THE INVENTION

[0001] The present invention relates to an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding keyhole limpet hemocyanin (KLH) and related nucleic acid molecules, vectors, cells, polypeptides, conjugates, fusion proteins and compositions; methods of treating cancer, pathogenic infection, and high blood pressure; methods of stimulating an immune response; and an isolated or purified nucleic acid molecule consisting essentially of the KLH leader sequence, optionally fused to an isolated or purified nucleic acid molecule encoding a polypeptide or protein, and related vectors and cells.

BACKGROUND OF THE INVENTION

[0002] KLH is a copper-containing respiratory protein in the marine gastropod Megathura crenulata. There are two isoforms of KLH, namely KLH-1 and KLH-2. KLH is clinically used as an immunotherapeutic agent, specifically in the treatment of bladder cancer (Jurincic et al., J. Urol. 139: 723-726 (1988); and Swerdlow et al., J. Urol. 151: 1718-1722 (1994)). KLH is believed to be potentially useful in the treatment of other carcinomas, such as the epithelial-derived adenocarcinomas, when used as a carrier for mucin-like epitopes and carcinoma ganglioside (Harris et al., Micron 30: 597-623 (1999)). KLH is also widely used as a hapten carrier and as a generalized vaccine component, such as in a vaccine for AIDS (Naylor et al., Int. J. Immunopharmacol. 13 Suppl. 1:117-127 (1991)) and for the prophylaxis of cocaine abuse (Bagasra et al., Immunopharmacology 23(3):173-179 (1992)). It is also used in the diagnosis of Schistosomiasis (Grzych et al., J. Exp. Med. 165: 865-878 (1987); Markl et al., Naturwissenschaften 78: 30-31 (1991); and Wishahi et al., J. Urol. 153: 926-928 (1995)) and in drug assays (Harris et al., supra). Other areas of use include the testing of immune competency (DeKruyff et al., J. Immunol. 15: 2578-2587 (1995); Helling et al., Cancer Res. 54: 197-203 (1994); and Jennemann et al., J. Biochem. (Tokyo) 115: 1047-1052 (1994)), the assessment of stress and the research of inflammatory conditions (Harris et al., supra).

[0003] KLH is currently isolated from Megathura crenulata maintained under mariculture conditions, or from naturally occurring organisms in the ocean. Maintaining and or obtaining M. crenulata under such conditions is difficult, costly and/or unreliable. In an effort to overcome these obstacles, Markl et al. (WO 00/55192 and WO 01/14536) attempted to obtain the complete coding sequence of the KLH gene but was unsuccessful. Thus, it is an object of the present invention to enable the recombinant production of KLH. This and other objects and advantages of the present invention, as well as additional inventive features, will become apparent upon reading the detailed description of the invention provided herein.

BRIEF DESCRIPTION OF THE INVENTION

[0004] The present invention provides an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-2 or a fragment thereof comprising at least 1,080 contiguous nucleotides. Similarly provided is an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding KLH-2 or a fragment thereof comprising at least 1,080 contiguous nucleotides.

[0005] The present invention provides an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-1 or a fragment thereof comprising at least 2,300 contiguous nucleotides. Similarly provided is an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding KLH-1 or a fragment thereof comprising at least 2,300 contiguous nucleotides.

[0006] In view of the above, the present invention further provides a vector comprising such an isolated or purified nucleic acid. When the isolated or purified nucleic acid molecule consists essentially of a nucleotide sequence encoding KLH or a fragment thereof, the nucleotide sequence is optionally part of an encoded fusion protein.

[0007] Also in view of the above, the present invention provides a cell comprising and expressing an above-described isolated or purified nucleic acid molecule. The isolated or purified nucleic acid molecule is optionally in the form of a vector.

[0008] Further provided by the present invention is an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-2 or at least 360 contiguous amino acids of KLH-2, either one of which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt and/or dimerized or polymerized. The isolated or purified polypeptide molecule is essentially free from KLH-1.

[0009] Further provided by the present invention is an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-1 or at least 770 contiguous amino acids of KLH-1, either one of which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt and/or dimerized or polymerized. The isolated or purified polypeptide molecule is essentially free from KLH-2.

[0010] In view of the above-described isolated or purified polypeptide molecule, the present invention further provides a conjugate or fusion protein comprising the isolated or purified polypeptide molecule or fragment thereof and a therapeutically or prophylactically active agent. The active agent can be an immunogen, such as an antigen of a pathogen or an antigen of a cancer. Alternatively, the active agent can be hemocyanin β-adrenergic receptor peptide.

[0011] Also, in view of the above-described isolated or purified polypeptide molecule, the present invention provides a composition comprising the isolated or purified polypeptide molecule, optionally in the form of a conjugate or a fusion protein comprising a therapeutically or prophylactically active agent, and an excipient or adjuvant.

[0012] Still further provided by the present invention is a method of treating cancer prophylactically or therapeutically in a mammal. The method comprises administering to the mammal an effective amount of:

[0013] (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the cancer, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0014] (b) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the cancer,

[0015] (c) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the cancer or an anti-cancer agent, or

[0016] (d) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the cancer and/or an anti-cancer agent, whereupon the mammal is treated for the cancer prophylactically or therapeutically.

[0017] Yet still further provided by the present invention is a method of treating a mammal prophylactically or therapeutically for a pathogenic infection. The method comprises administering to the mammal an effective amount of:

[0018] (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the pathogen, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0019] (b) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the pathogen,

[0020] (c) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the pathogen or an anti-pathogen agent, or

[0021] (d) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the pathogen and/or an anti-pathogen agent, whereupon the mammal is treated for the pathogenic infection prophylactically or therapeutically.

[0022] Methods of stimulating an immune response in a mammal are also provided by the present invention. In one embodiment, the method comprises administering to the mammal an effective amount of:

[0023] (a) an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-2 or an immunogenic fragment thereof comprising at least 1,080 contiguous nucleotides, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0024] (b) an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-1 or an immunogenic fragment thereof comprising at least 2,300 contiguous nucleotides, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0025] (c) an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or

[0026] (d) an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2,

[0027] whereupon an immune response in the mammal is stimulated. In another embodiment, the method comprises stimulating an immune response to an antigen in a mammal by administering to the mammal an effective amount of dendritic cells which have been previously isolated from the mammal and treated in vitro with:

[0028] (a) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen,

[0029] (b) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen, or

[0030] (c) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen,

[0031] whereupon an immune response to the antigen is induced in the mammal.

[0032] In addition, a method of treating high blood pressure in a mammal is provided by the present invention. The method comprises administering to the mammal an effective amount of:

[0033] (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) hemocyanin β-adrenergic receptor peptide, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0034] (b) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) hemocyanin β-adrenergic receptor peptide,

[0035] (c) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) hemocyanin β-adrenergic receptor peptide, or

[0036] (d) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) hemocyanin β-adrenergic receptor peptide, whereupon the mammal is treated for high blood pressure prophylactically or therapeutically.

[0037] A method of identifying an immunogenic portion of KLH is also provided. The method comprises:

[0038] (i) contacting antigen-presenting cells with KLH or a portion thereof,

[0039] (ii) harvesting the antigen-presenting cells that have been contacted with KLH or a portion thereof,

[0040] (iii) purifying complexes of major histocompatibility complex (MHC) and peptidic fragments derived from KLH from the harvested antigen-presenting cells,

[0041] (iv) isolating the peptidic fragments from the purified complexes, and

[0042] (v) analyzing the peptidic fragments to determine which fragments were bound to MHC, whereupon an immunogenic portion of KLH is identified.

[0043] An isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence encoding the leader sequence of KLH is also provided. An isolated or purified nucleic acid molecule consisting essentially of the aforementioned isolated or purified nucleic acid molecule coupled 5′ to an isolated or purified nucleic acid molecule encoding a polypeptide or protein, optionally in the form of a vector, is also provided. Upon expression in a eukaryotic cell, secretion of the encoded polypeptide or protein by the eukaryotic cell results. Thus, a eukaryotic cell comprising and expressing such an isolated or purified nucleic acid molecule is also provided.

BRIEF DESCRIPTION OF THE FIGURES

[0044]FIG. 1 represents the nucleotide (SEQ ID NO: 1 (coding); SEQ ID NO: 2 (complementary)) and deduced amino acid (SEQ ID NO: 3) sequences of KLH-2.

[0045]FIG. 2 represents the nucleotide (SEQ ID NO: 4 (coding); SEQ ID NO: 5 (complementary)) and deduced amino acid (SEQ ID NO: 6) sequences of the leader sequence of KLH-2.

[0046]FIG. 3 represents the nucleotide (SEQ ID NO: 7 (coding); SEQ ID NO: 8 (complementary)) and deduced amino acid (SEQ ID NO: 9) sequences of KLH-2 with the leader sequence.

[0047]FIG. 4 represents the nucleotide (SEQ ID NO: 10 (coding); SEQ ID NO: 11 (complementary)) and deduced amino acid (SEQ ID NO: 12) sequences of the 5′ untranslated region, the coding region, and the 3′ untranslated region of KLH-2.

[0048]FIG. 5 represents the nucleotide (SEQ ID NO: 22 (coding); SEQ ID NO: 23 (complementary)) and deduced amino acid (SEQ ID NO: 24) sequences of KLH-1.

[0049]FIG. 6 represents the nucleotide (SEQ ID NO: 25 (coding); SEQ ID NO: 26 (complementary)) and deduced amino acid (SEQ ID NO: 27) sequences of the leader sequence of KLH-1.

[0050]FIG. 7 represents the nucleotide (SEQ ID NO: 28 (coding); SEQ ID NO: 29 (complementary)) and deduced amino acid (SEQ ID NO: 30) sequences of KLH-1 with the leader sequence.

[0051]FIG. 8 represents the nucleotide (SEQ ID NO: 31 (coding); SEQ ID NO: 32 (complementary)) and deduced amino acid (SEQ ID NO: 33) sequences of the 5′ untranslated region, the coding region, and the 3′ untranslated region of KLH-1.

DETAILED DESCRIPTION OF THE INVENTION

[0052] The present invention provides an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-2 or a fragment thereof comprising at least 1,080 (or at least 1,100, 1,125, 1,150, 1,175 or 1,200 or more) contiguous nucleotides. The present invention further provides an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-1 or a fragment thereof comprising at least 2,300 (or at least 2,325, 2,350, 2,375 or 2,400 or more) contiguous nucleotides. By “isolated” is meant the removal of a nucleic acid from its natural environment. By “purified” is meant that a given nucleic acid, whether one that has been removed from nature (including genomic DNA and mRNA) or synthesized (including cDNA) and/or amplified under laboratory conditions, has been increased in purity, wherein “purity” is a relative term, not “absolute purity.” “Nucleic acid molecule” is intended to encompass a polymer of DNA or RNA, i.e., a polynucleotide, which can be single-stranded or double-stranded and which can contain non-natural or altered nucleotides. Desirably, the isolated or purified nucleic acid molecule does not contain any introns or portions thereof.

[0053] Preferably, the isolated or purified nucleic acid molecule that consists essentially of a nucleotide sequence encoding KLH-2 or a fragment thereof (i) encodes the amino acid sequence of SEQ ID NO: 3 or at least 360 (or at least 375, 400, 425 or 450 or more) contiguous amino acids of SEQ ID NO: 3, (ii) consists essentially of the nucleotide sequence of SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 (or at least 1,100, 1,125, 1,150, 1,175, 1,200 or more) contiguous nucleotides and is characterized by no more than 40% mismatch, or (iv) shares 60% (or 65%, 70%, 75, 80%, 85%, 90% or 95%) or more identity with SEQ ID NO: 1, such as naturally occurring and artificially generated variants as described herein. An isolated or purified nucleic acid molecule that hybridizes under highly stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides preferably is characterized by no more than 20% mismatch.

[0054] When the fragment is obtained from nucleotides 1-1,530 or nucleotides 9,012-10,206 of SEQ ID NO: 1 or from nucleotides 1-1,527 or nucleotides 7,975-10,197 of SEQ ID NO: 22, the fragment can be substantially smaller than 1,080 contiguous nucleotides (e.g., a fragment encoding a single epitope, such as an immunogenic epitope (e.g., an epitope that induces an immune response in a mammal)), and all fragments, irrespective of length (e.g., 1,050, 1,025, 1,000, 900, 800, 700, 600, 500, 475, 450, 425, 400, 375, 350, 325, 300, 275, 250, 225, 200, 175, 150, 125, 100, 75, 50, 25, 20, and 12-15) and unique to nucleotides 1-1,530 or nucleotides 9,012-10,206 of SEQ ID NO: 1 or nucleotides 1-1,527 or nucleotides 7,975-10,197 of SEQ ID NO: 22, are contemplated by the present invention. Another fragment that is unique to SEQ ID NO: 1 is that which spans nucleotides 7,472-7,495 of SEQ ID NO: 1, and all fragments, irrespective of length and unique to nucleotides 7,472-7,495 of SEQ ID NO: 1, are also contemplated. One of ordinary skill in the art will readily appreciate that fragments unique to KLH-2 can overlap the aforementioned regions in whole or in part and extend into adjoining regions of SEQ ID NO: 1. Likewise, fragments unique to KLH-1 can overlap the aforementioned regions in whole or in part and extend into adjoining regions of SEQ ID NO: 22.

[0055] Preferably, the isolated or purified nucleic acid molecule that consists essentially of a nucleotide sequence encoding KLH-1 or a fragment thereof (i) encodes the amino acid sequence of SEQ ID NO: 24 or at least 770 (or at least 800, 825, 850, 875 or 900 or more) contiguous amino acids of SEQ ID NO: 24, (ii) consists essentially of the nucleotide sequence of SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 (or at least 2,325, 2,350, 2,375, 2,400 or more) contiguous nucleotides, or (iv) shares 65% (or 70%, 75, 80%, 85%, 90% or 95%) or more identity with SEQ ID NO: 22, such as naturally occurring and artificially generated variants as described herein.

[0056] N-terminal sequence analysis of proteolytic cleavage products (Sohngen et al., Eur. J. Biochem. 248: 602-614 (1997)), along with analysis of amino acid sequence homology and conserved domains, enabled the determination of functional domains. Functional unit A of KLH-1 comprises amino acids 1-420 (SEQ ID NO: 24), nucleotides 1-1,260 (SEQ ID NO: 22), whereas functional unit A of KLH-2 comprises amino acids 1-422 (SEQ ID NO: 3), nucleotides 1-1,266 (SEQ ID NO: 1). Functional unit H of KLH-1 comprises amino acids 2,899-3,398 (SEQ ID NO: 24), nucleotides 8,695-10,197 (SEQ ID NO: 22), whereas functional unit H of KLH-2 comprises amino acids 2,905-3,401 (SEQ ID NO: 3), nucleotides 8,713-10,206 (SEQ ID NO: 1). Functional unit B of KLH-1 comprises amino acids 421-509 (SEQ ID NO: 24), nucleotides 1,261-1,527 (SEQ ID NO: 22), whereas functional unit B of KLH-2 comprises amino acids 423-510 (SEQ ID NO: 3), nucleotides 1,267-1,530 (SEQ ID NO: 1). Functional unit G of KLH-1 comprises amino acids 2,659-2,898 (SEQ ID NO: 24), nucleotides 7,975-8,694 (SEQ ID NO: 22).

[0057] An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding a variant KLH-2, or a fragment thereof comprising at least 1,080 (or at least 1,100, 1,125, 1,150, 1,175, or 1,200 or more) contiguous nucleotides, comprises one or more insertions, deletions, substitutions and/or inversions. Desirably, the variant KLH-2 does not differ functionally from the corresponding unmodified KLH-2, such as that comprising SEQ ID NO: 3. Preferably, the variant KLH-2 functions as an immunogen at least about 50%, more preferably at least about 75%, most preferably at least about 90% as well as the unmodified KLH-2 comprising SEQ ID NO: 3 as determined by an in vitro immunogenic assay. The manner in which the assay is carried out is not critical and can be conducted in accordance with methods known in the art. Preferably, the one or more substitution(s) results in the substitution of an amino acid of the encoded KLH-2 with another amino acid of approximately equivalent mass, structure and/or charge. A preferred variant KLH-2 comprises a deletion of all or part of the leader sequence, i.e., SEQ ID NO: 4, such that the encoded variant KLH-2 cannot be secreted by a cell.

[0058] An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding a variant KLH-1, or a fragment thereof comprising at least 2,300 (or at least 2,325, 2,350, 2,375, 2,400 or more) contiguous nucleotides, comprises one or more insertions, deletions, substitutions and/or inversions. Desirably, the variant KLH-1 does not differ functionally from the corresponding unmodified KLH-1, such as that comprising SEQ ID NO: 24. Preferably, the variant KLH-1 functions as an immunogen at least about 50%, more preferably at least about 75%, most preferably at least about 90% as well as the unmodified KLH-1 comprising SEQ ID NO: 24 as determined by an in vitro immunogenic assay. The manner in which the assay is carried out is not critical and can be conducted in accordance with methods known in the art. Preferably, the one or more substitution(s) results in the substitution of an amino acid of the encoded KLH-1 with another amino acid of approximately equivalent mass, structure and/or charge. A preferred variant KLH-1 comprises a deletion of all or part of the leader sequence, i.e., SEQ ID NO: 25, such that the encoded variant KLH-2 cannot be secreted by a cell.

[0059] The present invention also provides an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding KLH-2 or a fragment thereof comprising at least 1,080 (or at least 1,100, 1,125, 1,150, 1,175, or 1,200 or more) contiguous nucleotides. Such an isolated or purified nucleic acid molecule preferably (i) is complementary to a nucleotide sequence encoding the amino acid sequence of SEQ ID NO: 3 or at least 360 (or at least 375, 400, 425 or 450 or more) contiguous amino acids of SEQ ID NO: 3, (ii) is complementary to the nucleotide sequence of SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides and is characterized by no more than 40% mismatch, or (iv) shares 60% (or 65%, 70%, 75, 80%, 85%, 90% or 95%) or more identity with the nucleotide sequence that is complementary to SEQ ID NO: 1.

[0060] Likewise, the present invention provides an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding KLH-1 or a fragment thereof comprising at least 2,300 (or at least 2,325, 2,350, 2,375, 2,400 or more) contiguous nucleotides. Such an isolated or purified nucleic acid molecule preferably (i) is complementary to a nucleotide sequence encoding the amino acid sequence of SEQ ID NO: 24 or at least 770 (or at least 800, 825, 850, 875, 900 or more) contiguous amino acids of SEQ ID NO: 24, (ii) is complementary to the nucleotide sequence of SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides and is characterized by no more than 35% mismatch, or (iv) shares 65% (or 70%, 75, 80%, 85%, 90% or 95%) or more identity with the nucleotide sequence that is complementary to SEQ ID NO: 22. An isolated or purified nucleic acid molecule that hybridizes under highly stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides preferably is characterized by no more than 20% mismatch.

[0061] An isolated or purified nucleic acid molecule can consist essentially of a nucleotide sequence that is complementary to either of a nucleotide sequence encoding a variant KLH-2 or at least 360 (or at least 375, 400, 425 or 450 or more) contiguous amino acids of a variant KLH-2 as described above. Likewise, an isolated or purified nucleic acid molecule can consist essentially of a nucleotide sequence that is complementary to either of a nucleotide sequence encoding a variant KLH-1 or at least 770 (or at least 800, 825, 850, 875 or 900 or more) contiguous amino acids of a variant KLH-1 as described above.

[0062] With respect to the above, one of ordinary skill in the art knows how to generate insertions, deletions, substitutions and/or inversions, in a given nucleic acid molecule. See, for example, the references cited herein under “Example.” With respect to the above isolated or purified nucleic acid molecules, it is preferred that any such insertions, deletions, substitutions and/or inversions are introduced into the nucleotide sequence encoding the regions between the copper-binding domains of KLH or a variant thereof. It is also preferred that the one or more substitution(s) result(s) in the substitution of an amino acid with another amino acid of approximately equivalent size, shape and charge. Insertions, deletions, substitutions and/or inversions also can be introduced into one or more of the copper-binding domains. If such mutations are introduced into the nucleotide sequences encoding one or more of the copper-binding domains, desirably not all such domains are mutated and those copper-binding domains that are mutated are located at the N-terminal or C-terminal ends of KLH.

[0063] Also with respect to the above, “does not differ functionally from” is intended to mean that the variant KLH has activity characteristic of the unmodified KLH. In other words, it can induce an immune response in a mammal, resulting in the production of a specific antibody that can be measured by standard immunoassay techniques known in the art. However, the variant KLH can be more or less active than the unmodified KLH as desired in accordance with the present invention.

[0064] An indication that polynucleotide sequences are substantially identical is if two molecules selectively hybridize to each other under stringent conditions. The phrase “hybridizes to” refers to the selective binding of a single-stranded nucleic acid probe to a single-stranded target DNA or RNA sequence of complementary sequence when the target sequence is present in a preparation of heterogeneous DNA and/or RNA. “Stringent conditions” are sequence-dependent and will be different in different circumstances. Generally, stringent conditions are selected to be about 20° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe.

[0065] For example, under stringent conditions, as that term is understood by one skilled in the art, hybridization is preferably carried out using a standard hybridization buffer at a temperature ranging from about 50° C. to about 75° C., from about 60° C. to about 70° C., or from about 65° C. to about 68° C. Alternately, formamide can be included in the hybridization reaction, and the temperature of hybridization can be reduced to from about 35° C. to about 45° C., from about 40° C. to about 45° C., or to about 42° C. Desirably, formamide is included in the hybridization reaction at a concentration of from about 30% to about 50%, preferably from about 35% to about 45%, and optimally at about 40%. Moreover, optionally, the hybridized sequences are washed (if necessary to reduce non-specific binding) under relatively highly stringent conditions, as that term is understood by those skilled in the art. For instance, desirably, the hybridized sequences are washed one or more times using a solution comprising salt and detergent, at a temperature of from about 50° C. to about 75° C., from about 60° C. to about 70° C., or from about 65° C. to about 68° C. Preferably, a salt (e.g., such as sodium chloride) is included in the wash solution at a concentration of from about 0.01 M to about 1.0 M. Optimally, a detergent (e.g., such as sodium dodecyl sulfate) is also included at a concentration of from about 0.01% to about 1.0%. The following are examples of highly stringent and moderately stringent conditions for a Southern hybridization in aqueous buffers (no formamide) (Sambrook and Russell, Molecular Cloning, 3rd Ed. SCHL Press (2001)): Highly stringent Moderately Stringent hybridization conditions: hybridization conditions: 6X SSC or 6X SSPE 6X SSC or 6X SSPE 5x Denhardt's Reagent 5x Denhardt's Reagent 1% SDS 1% SDS 100 μg/ml salmon sperm DNA 10 μg/ml salmon sperm DNA hybridization at 65-68° C. hybridization at 58-64° C. Highly stringent Moderately stringent washing conditions: washing conditions: 0.1X SSC/0.1% SDS 2X SSC/0.1% SDS washing at 65-68° C. washing at 58-64° C.

[0066] In view of the above, “highly stringent conditions” allow for up to about 20% mismatch, preferably up to about 15% mismatch, more preferably up to about 10% mismatch, and most preferably less than about 5% mismatch, such as 4%, 3%, 2% or 1% mismatch. “At least moderately stringent conditions” preferably allow for up to about 40% mismatch, more preferably up to about 30% mismatch, and most preferably up to about 20% mismatch. “Low stringency conditions” preferably allow for up to about 60% mismatch, more preferably up to about 50% mismatch, and most preferably up to about 40% mismatch. With respect to the preceding ranges of mismatch, 1% mismatch corresponds to one degree decrease in the melting temperature.

[0067] The above isolated or purified nucleic acid molecules also can be characterized in terms of “percentage of sequence identity.” In this regard, a given nucleic acid molecule as described above can be compared to a nucleic acid molecule encoding a corresponding gene (i.e., the reference sequence) by optimally aligning the nucleic acid sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence, which does not comprise additions or deletions, for optimal alignment of the two sequences. The percentage of sequence identity is calculated by determining the number of positions at which the identical nucleic acid base occurs in both sequences, i.e., the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity. Optimal alignment of sequences for comparison may be conducted by computerized implementations of known algorithms (e.g., GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wis., or BlastN and BlastX available from the National Center for Biotechnology Information, Bethesda, Md.), or by inspection. Sequences are typically compared using BESTFIT or BlastN with default parameters.

[0068] “Substantial sequence identity” means that at least 75%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95% (such as 96%, 97%, 98% or 99%) of the sequence of a given nucleic acid molecule is identical to a given reference sequence. Typically, two polypeptides are considered to be substantially similar if at least 40%, preferably at least 60%, more preferably at least 90%, and most preferably at least 95% (such as 96%, 97%, 98% or 99%) of the amino acids of which the polypeptides are comprised are identical to or represent conservative substitutions of the amino acids of a given reference sequence.

[0069] One of ordinary skill in the art will appreciate, however, that two polynucleotide sequences can be substantially different at the nucleic acid level, yet encode substantially similar, if not identical, amino acid sequences, due to the degeneracy of the genetic code. The present invention is intended to encompass such polynucleotide sequences.

[0070] While the above-described nucleic acid molecules can be isolated or purified, alternatively they can be synthesized. Methods of nucleic acid synthesis are known in the art. See, e.g., the references cited herein under “Example.”

[0071] The above-described nucleic acid molecules can be used, in whole or in part (i.e., as fragments or primers), to identify and isolate corresponding genes from other organisms for use in the context of the present inventive methods using conventional means known in the art. See, for example, the references cited herein under “Examples.” Evolutionarily conserved portions of the sequence given in SEQ ID NO:1 or SEQ ID NO: 22, for example, can be used to identify highly related hemocyanin nucleic acids, such as in Rapana thomasiana and Megathura crenulata, among others. In addition, the above-described nucleic acid molecules (or fragments thereof) can be used as probes to identify limpet cells that produce KLH.

[0072] In view of the above, the present invention also provides a vector comprising an above-described isolated or purified nucleic acid molecule, optionally as part of an encoded fusion protein. A nucleic acid molecule as described above can be cloned into any suitable vector and can be used to transform or transfect any suitable host. The selection of vectors and methods to construct them are commonly known to persons of ordinary skill in the art and are described in general technical references (see, in general, “Recombinant DNA Part D,” Methods in Enzymology, Vol. 153, Wu and Grossman, eds., Academic Press (1987) and the references cited herein under “Example”). Desirably, the vector comprises regulatory sequences, such as transcription and translation initiation and termination codons, which are specific to the type of host (e.g., bacterium, fungus, plant or animal) into which the vector is to be introduced, as appropriate and taking into consideration whether the vector is DNA or RNA. Preferably, the vector comprises regulatory sequences that are specific to the genus of the host. Most preferably, the vector comprises regulatory sequences that are specific to the species of the host.

[0073] Constructs of vectors, which are circular or linear, can be prepared to contain an entire nucleic acid sequence as described above or a portion thereof ligated to a replication system functional in a prokaryotic or eukaryotic host cell. Replication systems can be derived from ColE1, 2 mμ plasmid, λ, SV40, bovine papilloma virus, and the like.

[0074] In addition to the replication system and the inserted nucleic acid, the construct can include one or more marker genes, which allow for selection of transformed or transfected hosts. Marker genes include biocide resistance, e.g., resistance to antibiotics, heavy metals, etc., complementation in an auxotrophic host to provide prototrophy, and the like.

[0075] Suitable vectors include those designed for propagation and expansion or for expression or both. A preferred cloning vector is selected from the group consisting of the pUC series the pBluescript series (Stratagene, LaJolla, Calif.), the pET series (Novagen, Madison, Wis.), the pGEX series (Pharmacia Biotech, Uppsala, Sweden), and the pEX series (Clontech, Palo Alto, Calif.). Bacteriophage vectors, such as λGT10, λGT11, αZapII (Stratagene), λ EMBL4, and λ NM1149, also can be used. Examples of plant expression vectors include pBI101, pBI101.2, pBI101.3, pBI121 and pBIN19 (Clontech). Examples of animal expression vectors include pEUK-C1, pMAM and pMAMneo (Clontech).

[0076] An expression vector can comprise a native or nonnative promoter operably linked to an isolated or purified nucleic acid molecule as described above. The selection of promoters, e.g., strong, weak, inducible, tissue-specific and developmental-specific, is within the skill in the art. Similarly, the combining of a nucleic acid molecule as described above with a promoter is also within the skill in the art.

[0077] Optionally, the isolated or purified nucleic acid molecule can be part of an encoded fusion protein. The generation of fusion proteins is within the ordinary skill in the art (see, e.g., references cited under “Example”) and can involve the use of restriction enzyme or recombinational cloning techniques (see, e.g., Gateway™ (Invitrogen, Carlsbad, Calif.). See, also, U.S. Pat. No. 5,314,995.

[0078] Also in view of the above, the present invention provides a host cell comprising and expressing an isolated or purified nucleic acid molecule, optionally in the form of a vector, as described above. Examples of host cells include, but are not limited to, a human cell, a human cell line, E. coli (e.g., E. coli TB-1, TG-2, DH5α, XL-Blue MRF' (Stratagene), SA2821 and Y1090), B. subtilis, P. aerugenosa, S. cerevisiae, N. crassa, insect cells (e.g., Sf9, Ea4) and others set forth herein below. Depending on the construct, it may be necessary to keep a culture of E. coli at low density for cloning purposes and, possibly, also for expression.

[0079] The present invention further provides an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-2 or at least 360 (or at least 375, 400, 425 or 450 or more) contiguous amino acids of KLH-2, which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt and/or dimerized or polymerized, wherein the isolated or purified polypeptide molecule is essentially free from KLH-1. Also provided is an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-1 or at least 770 (or at least 800, 825, 850, 875 or 900 or more) contiguous amino acids of KLH-1, which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt and/or dimerized or polymerized, wherein the isolated or purified polypeptide molecule is essentially free from KLH-2. By “polymerized” is meant at least trimerized. Especially preferred polymers include decamers and didecamers. Methods of protein modification (e.g., glycosylation, amidation, carboxylation, phosphorylation, esterification, N-acylation, and conversion into acid addition salts), dimerization and polymerization are known in the art.

[0080] When the fragment is obtained from amino acids 1-510 or amino acids 3,004-3,401 of SEQ ID NO: 3, the fragment can be substantially smaller than 360 contiguous amino acids (e.g., a fragment comprising a single epitope, such as an immunogenic epitope (e.g., an epitope that induces an immune response in a mammal)), and all fragments, irrespective of length (e.g., 350, 325, 300, 275, 250, 225, 200, 175, 150, 125, 100, 75, 50, 25, 20, 15, 10, and 4-5) and unique to amino acids 1-510 or amino acids 3,004-3,401 of SEQ ID NO: 3, are contemplated by the present invention. Other fragments that are unique to SEQ ID NO: 3 are those that span amino acids 2,491-2,498 and 2,892-2,907 of SEQ ID NO: 3. One of ordinary skill in the art will readily appreciate that fragments unique to KLH-2 can overlap the aforementioned regions in whole or in part and extend into adjoining regions of SEQ ID NO: 3.

[0081] When the fragment is obtained from amino acids 1-509 or amino acids 2,659-3,398 of SEQ ID NO: 24, the fragment can be substantially smaller than 770 continguous amino acids (e.g., a fragment comprising a single epitope, such as an immunogenic epitope (e.g., an epitope that induces an immune response in a mammal)), and all fragments, irrespective of length (e.g., 750, 725, 700, 675, 650, 625, 600, 575, 550, 525, 500, 475, 450, 425, 400, 375, 350, 325, 300, 275, 250, 225, 200, 175, 150, 125, 100, 75, 50, 25, 20, 15, 10, and 4-5) and unique to amino acids 1-509 or amino acids 2,659-3,398 of SEQ ID NO: 24 are contemplated by the present invention. One of ordinary skill in the art will readily appreciate that fragments unique to KLH-1 can overlap the aforementioned regions in whole or in part and extend into adjoining regions of SEQ ID NO: 24.

[0082] One of ordinary skill in the art will readily appreciate that one or more fragments of KLH-2, such as from amino acids 1-510 or amino acids 3,004-3,401 of SEQ ID NO: 3, can be combined with one or more fragments of KLH-1, such as from amino acids 1-509 or amino acids 2,659-3,398 of SEQ ID NO: 24. Such combinations of fragments can be generated at the nucleic acid or amino acid level in accordance with methods known in the art.

[0083] An isolated or purified polypeptide molecule can consist essentially of an amino acid sequence encoding a variant KLH-2 or at least 360 (or at least 375, 400, 425 or 450 or more) contiguous amino acids of a variant KLH-2, which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated, converted into an acid addition salt and/or dimerized or polymerized, wherein the isolated or purified polypeptide molecule is essentially free from KLH-1. Alternatively, an isolated or purified polypeptide molecule can consist essentially of an amino acid sequence encoding a variant KLH-1 or at least 770 (or at least 800, 825, 850, 875 or 900 or more) contiguous amino acids of a variant KLH-1, which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated, converted into an acid addition salt and/or dimerized or polymerized, wherein the isolated or purified polypeptide molecule is essentially free from KLH-2.

[0084] In view of the above isolated or purified polypeptide molecules and variants thereof, the present invention also provides fragments thereof comprising at least one epitope that can bind to a receptor on the surface of a cell that functions in cellular immunity, such as a T cell or a B cell. An example of such a receptor is the T cell receptor. Such fragments can consist essentially of from about 5 to about 8 amino acids, from about 10 to about 16 amino acids, from about 20 to about 30 amino acids, or can comprise substantially more amino acids. The fragments can be readily generated by the ordinarily skilled artisan in accordance with methods known in the art. The ability of such fragments to bind to a receptor on the surface of a cell that functions in cellular immunity also can be determined in accordance with methods known in the art. The fragments can comprise from about 5 to about 8 amino acids or more in the following regions of SEQ ID NO: 3: amino acids 1-510, 539-543, 823-827, 1,655-1,660, 2,491-2,498, 2,892-2,907, and 3,004-3,401 or in the following regions of SEQ ID NO: 24: amino acids 1-509 and 2,659-3,398. These regions are exemplary and are not limiting.

[0085] Functional units of KLH can be determined by N-terminal sequence analysis of proteolytic cleavage products. See, e.g., Harris et al., Micron 30: 597-623 (1999). Reverse immunogenetics can be used to determine peptides presented by the major histocompatibility complex (MHC) class I and II molecules to T-cell receptors upon stimulation with KLH in order to identify the most potent peptides and domains of KLH for antigen presentation. See, e.g., Immunobiology, 5th ed., Janeway et al., eds., Garland Publishing, New York and London (2001).

[0086] In view of the foregoing, the present invention provides a method of identifying an immunogenic portion of KLH. The method comprises:

[0087] (i) contacting antigen-presenting cells with KLH or a portion thereof,

[0088] (ii) harvesting the antigen-presenting cells that have been contacted with KLH or a portion thereof,

[0089] (iii) purifying complexes of major histocompatibility complex (MHC) and peptidic fragments derived from KLH from the harvested antigen-presenting cells,

[0090] (iv) isolating the peptidic fragments from the purified complexes, and

[0091] (v) analyzing the peptidic fragments to determine which fragments were bound to MHC, whereupon an immunogenic portion of KLH is identified. The method can be conducted in vitro or in vivo. If conducted in vivo, a preferred organism is a mammal. The complexes can be purified by any suitable purification method know in the art, such as immuno-affinity purification. The peptidic fragments can be isolated from the purified complexes by any suitable means, such as acetic acid elution. Preferably, the KLH is KLH-1 in the absence of KLH-2 or KLH-2 in the absence of KLH-1. The peptides can be analyzed in accordance with methods well-known in the art, such as by post-source decay and matrix-assisted laser desorption ionization time-of-flight mass spectrometry (PSD-MALDI-TOF-MS) (see, e.g., J. Immunol. Methods 246: 1-12 (2000)). Accordingly, the present invention provides an immunogenic portion of KLH identified in accordance with the method.

[0092] The polypeptide preferably comprises an amino end and a carboxyl end. The polypeptide can comprise D-amino acids, L-amino acids or a mixture of D- and L-amino acids. The D-form of the amino acids, however, is particularly preferred since a polypeptide comprised of D-amino acids is expected to have a greater retention of its biological activity in vivo, given that the D-amino acids are not recognized by naturally occurring proteases.

[0093] The polypeptide can be prepared by any of a number of conventional techniques. The polypeptide can be isolated or purified from a naturally occurring source or from a recombinant source. While the polypeptide can be isolated or purified from a naturally occurring source, such isolation is difficult and costly and does not readily or easily enable the separation of KLH polypeptides of isotypes 1 and 2. Thus, recombinant production is preferred. For instance, in the case of recombinant polypeptides, a DNA fragment encoding a desired peptide can be subcloned into an appropriate vector using well-known molecular genetic techniques (see, e.g., Maniatis et al., Molecular Cloning: A Laboratory Manual, 2nd ed. (Cold Spring Harbor Laboratory, 1982); Sambrook et al., Molecular Cloning: A Laboratory Manual, 2^(nd) ed. (Cold Spring Harbor Laboratory, 1989). The fragment can be transcribed and the polypeptide subsequently translated in vitro. Commercially available kits can also be employed (e.g., such as manufactured by Clontech, Palo Alto, Calif.; Amersham Pharmacia Biotech Inc., Piscataway, N.J.; InVitrogen, Carlsbad, Calif., and the like). The polymerase chain reaction optionally can be employed in the manipulation of nucleic acids.

[0094] Alterations of the native amino acid sequence to produce variant polypeptides can be done by a variety of means known to those skilled in the art. For instance, site-specific mutations can be introduced by ligating into an expression vector a synthesized oligonucleotide comprising the modified site. Alternately, oligonucleotide-directed site-specific mutagenesis procedures can be used such as disclosed in Walder et al., Gene 42: 133 (1986); Bauer et al., Gene 37: 73 (1985); Craik, Biotechniques, 12-19 (January 1995); and U.S. Pat. Nos. 4,518,584 and 4,737,462.

[0095] With respect to the above isolated or purified polypeptides, it is preferred that any such insertions, deletions and/or substitutions are introduced between copper-binding domains of the KLH or a variant thereof. It is also preferred that the one or more substitution(s) result(s) in the substitution of an amino acid with another amino acid of approximately equivalent mass, structure and charge. Insertions, deletions and/or substitutions also can be introduced into one or more of the copper-binding domains. If such mutations are introduced into one or more of the copper-binding domains, desirably not all copper-binding domains are mutated and domains that are mutated are located at the N-terminal or C-terminal ends of KLH.

[0096] Any appropriate expression vector (e.g., as described in Pouwels et al., Cloning Vectors: A Laboratory Manual (Elsevier, NY: 1985)) and corresponding suitable host can be employed for production of recombinant polypeptides. Expression hosts include, but are not limited to, bacterial species within the genera Escherichia, Bacillus, Pseudomonas, Salmonella, mammalian or insect host cell systems including baculovirus systems (e.g., as described by Luckow et al., Bio/Technology 6: 47 (1988)), and established cell lines such as the COS-7, C127, 3T3, CHO, HeLa, BHK cell line, and the like. The ordinarily skilled artisan is, of course, aware that the choice of expression host has ramifications for the type of polypeptide produced. For instance the glycosylation of polypeptides produced in yeast or mammalian cells (e.g., COS-7 cells) will differ from that of polypeptides produced in bacterial cells such as Escherichia coli.

[0097] Alternately, the polypeptide (including the variant polypeptides) can be synthesized using standard peptide synthesizing techniques well-known to those of skill in the art (e.g., as summarized in Bodanszky, Principles of Peptide Synthesis, (Springer-Verlag, Heidelberg: 1984)). In particular, the polypeptide can be synthesized using the procedure of solid-phase synthesis (see, e.g., Merrifield, J. Am. Chem. Soc. 85: 2149-54 (1963); Barany et al., Int. J. Peptide Protein Res. 30: 705-739 (1987); and U.S. Pat. No. 5,424,398). If desired, this can be done using an automated peptide synthesizer. Removal of the t-butyloxycarbonyl (t-BOC) or 9-fluorenylmethyloxycarbonyl (Fmoc) amino acid blocking groups and separation of the polypeptide from the resin can be accomplished by, for example, acid treatment at reduced temperature. The polypeptide-containing mixture can then be extracted, for instance, with dimethyl ether, to remove non-peptidic organic compounds, and the synthesized polypeptide can be extracted from the resin powder (e.g., with about 25% w/v acetic acid). Following the synthesis of the polypeptide, further purification (e.g., using high performance liquid chromatography (HPLC)) optionally can be done in order to eliminate any incomplete polypeptides or free amino acids. Amino acid and/or HPLC analysis can be performed on the synthesized polypeptide to validate its identity. For other applications according to the invention, it may be preferable to produce the polypeptide as part of a larger fusion protein, such as by the methods described herein or other genetic means, or as part of a larger conjugate, such as through physical or chemical conjugation, as known to those of ordinary skill in the art and described herein.

[0098] If desired, the polypeptides of the invention (including variant polypeptides) can be modified, for instance, by glycosylation, amidation, carboxylation, or phosphorylation, or by the creation of acid addition salts, amides, esters, in particular C-terminal esters, and N-acyl derivatives of the polypeptides of the invention. The polypeptides also can be modified to create polypeptide derivatives by forming covalent or noncovalent complexes with other moieties in accordance with methods known in the art. Covalently-bound complexes can be prepared by linking the chemical moieties to functional groups on the side chains of amino acids comprising the polypeptides, or at the N- or C-terminus.

[0099] Thus, in this regard, the present invention also provides a fusion protein and a conjugate comprising an above-described isolated or purified polypeptide molecule or fragment thereof and a therapeutically or prophylactically active agent. “Prophylactically” as used herein does not necessarily mean prevention, although prevention is encompassed by the term. Prophylactic activity also can include lesser effects, such as inhibition of the spread of cancer or a pathogenic infection; etc. Preferably, the active agent is a hapten, an allergen, a sperm coat protein (e.g., for use as a contraceptive), an immunogen, or hemocyanin β-adrenergic receptor peptide. Preferably, the immunogen is an antigen of a pathogen, e.g., bacterial, viral or parasitic pathogen, or an antigen of a cancer. Methods of conjugation are known in the art. In addition, conjugate kits are commercially available. For examples of methods of conjugation and conjugates see, e.g., Hermanson, G. T., Bioconjugate Techniques, 1996, Academic Press, San Diego, Calif.; U.S. Pat. Nos. 6,013,779; 6,274,552 and 6,080,725 and Ragupathi et al., Glycoconjugate Journal 15: 217-221 (1998).

[0100] The present invention also provides a composition comprising an above-described isolated or purified polypeptide molecule, optionally in the form of a conjugate or a fusion protein comprising a prophylactically or therapeutically active agent, and an excipient or an adjuvant. Excipients and adjuvants are well-known in the art, and are readily available. The choice of excipient/adjuvant will be determined in part by the particular route of administration and whether a nucleic acid molecule or a polypeptide molecule (or conjugate or fusion protein thereof) is being administered. Accordingly, there is a wide variety of suitable formulations for use in the context of the present invention, and the invention expressly provides a pharmaceutical composition that comprises an active agent of the invention and a pharmaceutically acceptable excipient/adjuvant. The following methods and excipients/adjuvants are merely exemplary and are in no way limiting.

[0101] Formulations suitable for oral administration can consist of (a) liquid solutions, such as an effective amount of the compound dissolved in diluent, such as water, saline, or orange juice; (b) capsules, sachets or tablets, each containing a predetermined amount of the active ingredient, as solids or granules; (c) suspensions in an appropriate liquid; and (d) suitable emulsions. Tablet forms can include one or more of lactose, mannitol, corn starch, potato starch, microcrystalline cellulose, acacia, gelatin, colloidal silicon dioxide, croscarmellose sodium, talc, magnesium stearate, stearic acid, and other excipients, colorants, diluents, buffering agents, moistening agents, preservatives, flavoring agents, and pharmacologically compatible excipients. Lozenge forms can comprise the active ingredient in a flavor, usually sucrose and acacia or tragacanth. Pastilles can comprise the active ingredient in an inert base, such as gelatin and glycerin, or sucrose and acacia, emulsions, gels, and the like containing, in addition to the active ingredient, such excipients/carriers as are known in the art.

[0102] An active agent of the present invention, either alone or in combination with other suitable components, can be made into aerosol formulations to be administered via inhalation. These aerosol formulations can be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. They also can be formulated as pharmaceuticals for non-pressured preparations such as in a nebulizer or an atomizer.

[0103] Formulations suitable for parenteral administration include aqueous and non-aqueous, isotonic sterile injection solutions, which can contain anti-oxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives. The formulations can be presented in unit-dose or multi-dose sealed containers, such as ampules and vials, and can be stored in a freeze-dried (lyophilized) condition requiring only the addition of the sterile liquid excipient, for example, water, for injections, immediately prior to use. Extemporaneous injection solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind previously described.

[0104] Additionally, active agents of the present invention can be made into suppositories by mixing with a variety of bases such as emulsifying bases or water-soluble bases. Formulations suitable for vaginal administration can be presented as pessaries, tampons, creams, gels, pastes, foams, or spray formulas containing, in addition to the active ingredient, such carriers as are known in the art to be appropriate. Further suitable formulations are found in Remington's Pharmaceutical Sciences, 17th ed., (Mack Publishing Company, Philadelphia, Pa.: 1985), and methods of drug delivery are reviewed in, for example, Langer, Science 249: 1527-1533 (1990).

[0105] In view of the above, the present invention provides a method of treating cancer prophylactically or therapeutically in a mammal. The method comprises administering to the mammal an effective amount of:

[0106] (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) an antigen of the cancer, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0107] (b) a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) an antigen of the cancer,

[0108] (c) a conjugate comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1 or an immunogenic fragment of either of the foregoing comprising at least 770 amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the cancer or an anti-cancer agent, or

[0109] (d) a composition comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1, or an immunogenic fragment of either of the foregoing comprising at least 770 amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the cancer and/or an anti-cancer agent, whereupon the mammal is treated for the cancer prophylactically or therapeutically. Such a method can further comprise the co-administration, whether simultaneously or sequentially, in either order, of an anti-cancer agent by the same or different route.

[0110] Preferably, the cancer is adenocarcinoma, bladder cancer, colon cancer, breast cancer, lung cancer or skin cancer. The skin cancer is preferably melanoma.

[0111] The anti-cancer agent can be a chemotherapeutic agent, e.g., a polyamine or an analogue thereof. Examples of therapeutic polyamines include those set forth in U.S. Pat. Nos. 5,880,161, 5,541,230 and 5,962,533, Saab et al., J. Med. Chem. 36: 2998-3004 (1993), Bergeron et al., J. Med. Chem. 37(21): 3464-3476 (1994), Casero et al., Cancer Chemother. Pharmacol 36: 69-74 (1995), Bernacki et al., Clin. Cancer Res. 1: 847-857 (1995); Bergeron et al., J. Med. Chem. 40: 1475-1494 (1997); Gabrielson et al., Clinical Cancer Res. 5: 1638-1641 (1999), and Bergeron et al., J. Med. Chem. 43: 224-235 (2000), which can be administered alone or in combination with other active agents, such as anti-cancer agents, e.g., cis-diaminedichloroplatinum (II) and 1,3-bis(2-chloroethyl)-1-nitrosourea.

[0112] Preferred routes of administration in the first embodiment of the method of treating cancer include intratumoral and peritumoral routes of administration. A preferred manner of administering a separate anti-cancer agent is by targeting to a cancer cell. In this regard, examples of cancer-specific, cell-surface molecules include placental alkaline phosphatase (testicular and ovarian cancer), pan carcinoma (small cell lung cancer), polymorphic epithelial mucin (ovarian cancer), prostate-specific membrane antigen, α-fetoprotein, B-lymphocyte surface antigen (B-cell lymphoma), truncated EGFR (gliomas), idiotypes (B-cell lymphoma), gp95/gp97 (melanoma), N-CAM (small cell lung carcinoma), cluster w4 (small cell lung carcinoma), cluster 5A (small cell carcinoma), cluster 6 (small cell lung carcinoma), PLAP (seminomas, ovarian cancer, and non-small cell lung cancer), CA-125 (lung and ovarian cancers), ESA (carcinoma), CD19, 22 or 37 (B-cell lymphoma), 250 kD proteoglycan (melanoma), P55 (breast cancer), TCR-IgH fusion (childhood T-cell leukemia), blood group A antigen in B or O type individual (gastric and colon tumors), and the like. See, e.g., U.S. Pat. No. 6,080,725 for other examples.

[0113] Examples of cancer-specific, cell-surface receptors include erbB-2, erbB-3, erbB-4, IL-2 (lymphoma and leukemia), IL-4 (lymphoma and leukemia), IL-6 (lymphoma and leukemia), MSH (melanoma), transferrin (gliomas), tumor vasculature integrins, and the like. Preferred cancer-specific, cell-surface receptors include erbB-2 and tumor vasculature integrins, such as CD11a, CD11b, CD11c, CD18, CD29, CD51, CD61, CD66d, CD66e, CD106, and CDw145.

[0114] There are a number of antibodies to cancer-specific, cell-surface molecules and receptors that are known. C46 Ab (Amersham) and 85A12 Ab (Unipath) to carcino-embryonic antigen, H17E2 Ab (ICRF) to placental alkaline phosphatase, NR-LU-10 Ab (NeoRx Corp.) to pan carcinoma, HMFC1 Ab (ICRF) to polymorphic epithelial mucin, W14 Ab to B-human chorionic gonadotropin, RFB4 Ab (Royal Free Hospital) to B-lymphocyte surface antigen, A33 Ab (Genex) to human colon carcinoma, TA-99 Ab (Genex) to human melanoma, antibodies to c-erbB2 (JP 7309780, JP 8176200 and JP 7059588), and the like. ScAbs can be developed, based on such antibodies, using techniques known in the art (see for example, Bind et al., Science 242: 423-426 (1988), and Whitlow et al., Methods 2(2): 97-105 (1991)).

[0115] Also in view of the above, the present invention provides a method of treating a mammal prophylactically or therapeutically for a pathogenic infection. The method comprises administering to the mammal an effective amount of:

[0116] (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) an antigen of the pathogen, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0117] (b) a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) an antigen of the pathogen,

[0118] (c) a conjugate comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the pathogen or an anti-pathogen agent, or

[0119] (d) a composition comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1, or an immunogenic fragment of either of the foregoing comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the pathogen or an anti-pathogen agent, whereupon the mammal is treated for the viral infection prophylactically or therapeutically. The pathogen can be any pathogen, such as a virus, a bacterium or a fungus. For examples of bacterial and viral antigens, see, e.g., U.S. Pat. No. 6,080,725. Fungal antigens are known in the art. Anti-pathogen agents, i.e., active agents used to treat prophylactically or therapeutically infection of an animal, such as a mammal, in particular a human, with a virus, bacterium or fungus, are known in the art.

[0120] A method of stimulating an immune response (i.e., immune reactants, e.g., antibodies or specific effector T-cells, directed at an antigen) in a mammal is also provided. In one embodiment, the method comprises administering to the mammal an effective amount of:

[0121] (a) an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-2, a variant KLH-2 or an immunogenic fragment of either of the foregoing comprising at least 1,080 contiguous nucleotides, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0122] (b) an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-1, a variant KLH-1 or an immunogenic fragment of either of the foregoing comprising at least 2,300 contiguous nucleotides, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0123] (c) an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-2, a variant KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or

[0124] (d) an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-1, a variant KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2,

[0125] whereupon an immune response in the mammal is stimulated. Preferably, the stimulation of the immune response results in the inhibition of tyrosinase. In another embodiment, the method comprises stimulating an immune response to an antigen in a mammal by administering to the mammal an effective amount of dendritic cells which have been previously isolated from the mammal and treated in vitro with:

[0126] (a) a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) an antigen,

[0127] (b) a conjugate comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1, or an immunogenic fragment of either of the foregoing comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen, or

[0128] (c) a composition comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1, or an immunogenic fragment of either of the foregoing comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen,

[0129] whereupon an immune response to the antigen is induced in the mammal.

[0130] Preferably, the antigen is an antigen of a pathogen or an antigen of a cancer as herein described. The method can further comprise the simultaneous or sequential co-administration of an active agent (e.g., an anti-cancer agent or an anti-pathogen agent) by the same or different route. Methods of isolating, treating ex vivo, and administering dendritic cells are known in the art. See, e.g., International Patent Application WO 00/73432.

[0131] Further provided is a method of treating high blood pressure in a mammal. The method comprises administering to the mammal an effective amount of:

[0132] (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) hemocyanin β-adrenergic receptor peptide, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector,

[0133] (b) a fusion protein comprising (i) KLH, a variant KLH or an immunogenic fragment of either of the foregoing, and (ii) hemocyanin β-adrenergic receptor peptide,

[0134] (c) a conjugate comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1, or an immunogenic fragment of either of the foregoing comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) hemocyanin β-adrenergic receptor peptide, or

[0135] (d) a composition comprising (i) (a) KLH-2, a variant KLH-2, or an immunogenic fragment of either of the foregoing comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1, a variant KLH-1, or an immunogenic fragment of either of the foregoing comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) hemocyanin β-adrenergic receptor peptide, whereupon the mammal is treated for high blood pressure prophylactically or therapeutically.

[0136] Generally, when KLH (or a conjugate or fusion protein thereof) is administered to an animal, such as a mammal, in particular a human, it is preferable that KLH be administered in a dose of from about 1 to about 1,000 μg/kg body weight/treatment when given parenterally. However, this dosage range is merely preferred, and higher or lower doses may be chosen in appropriate circumstances. For instance, the actual dose and schedule can vary depending on whether the composition is administered in combination with other pharmaceutical compositions, or depending on interindividual differences in pharmacokinetics, drug disposition, and metabolism. One skilled in the art easily can make any necessary adjustments in accordance with the necessities of the particular situation.

[0137] Those of ordinary skill in the art can easily make a determination of the amount of an above-described isolated and purified nucleic acid molecule to be administered to an animal, such as a mammal, in particular a human. The dosage will depend upon the particular method of administration, including any vector or promoter utilized. For purposes of considering the dose in terms of particle units (pu), also referred to as viral particles, it can be assumed that there are 100 particles/pfu (e.g., 1×10¹² pfu is equivalent to 1×10¹⁴ pu). An amount of recombinant virus, recombinant DNA vector or RNA genome sufficient to achieve a tissue concentration of about 10² to about 10¹² particles per ml is preferred, especially of about 10⁶ to about 10¹⁰ particles per ml. In certain applications, multiple daily doses are preferred. Moreover, the number of doses will vary depending on the means of delivery and the particular recombinant virus, recombinant DNA vector or RNA genome administered.

[0138] A targeting moiety also can be used in the contact of a cell with an above-described isolated or purified nucleic acid molecule. In this regard, any molecule that can be linked with the therapeutic nucleic acid directly or indirectly, such as through a suitable delivery vehicle, such that the targeting moiety binds to a cell-surface receptor, can be used. The targeting moiety can bind to a cell through a receptor, a substrate, an antigenic determinant or another binding site on the surface of the cell. Examples of a targeting moiety include an antibody (i.e., a polyclonal or a monoclonal antibody), an immunologically reactive fragment of an antibody, an engineered immunoprotein and the like, a protein (target is receptor, as substrate, or regulatory site on DNA or RNA), a polypeptide (target is receptor), a peptide (target is receptor), a nucleic acid, which is DNA or RNA (i.e., single-stranded or double-stranded, synthetic or isolated and purified from nature; target is complementary nucleic acid), a steroid (target is steroid receptor), and the like. Analogs of targeting moieties that retain the ability to bind to a defined target also can be used. In addition, synthetic targeting moieties can be designed, such as to fit a particular epitope. Alternatively, the therapeutic nucleic acid can be encapsulated in a liposome comprising on its surface the targeting moiety.

[0139] The targeting moiety includes any linking group that can be used to join a targeting moiety to, in the context of the present invention, an above-described nucleic acid molecule. It will be evident to one skilled in the art that a variety of linking groups, including bifunctional reagents, can be used. The targeting moiety can be linked to the therapeutic nucleic acid by covalent or non-covalent bonding. If bonding is non-covalent, the conjugation can be through hydrogen bonding, ionic bonding, hydrophobic or van der Waals interactions, or any other appropriate type of binding.

[0140] An isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 4 or SEQ ID NO: 25 or encoding SEQ ID NO: 6 or SEQ ID NO: 27 is also provided. Additionally provided is an isolated or purified nucleic acid molecule consisting essentially of the aforementioned nucleic acid molecule coupled 5′ to an isolated or purified nucleic acid molecule encoding a polypeptide or protein, optionally in the form of a vector, which, upon expression in a eukaryotic cell, results in the secretion of the encoded polypeptide or protein by the eukaryotic cell. See, e.g., U.S. Pat. No. 6,169,172. Thus, a eukaryotic cell comprising and expressing such a nucleic acid molecule is also provided.

EXAMPLES

[0141] The following examples serve to illustrate the present invention and is not intended to limit its scope in any way.

[0142] The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference:

[0143] Birren et al., Genome Analysis: A Laboratory Manual Series, Volume 1, Analyzing DNA, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1997),

[0144] Birren et al., Genome Analysis: A Laboratory Manual Series, Volume 2, Detecting Genes, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1998),

[0145] Birren et al., Genome Analysis: A Laboratory Manual Series, Volume 3, Cloning Systems, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1999),

[0146] Birren et al., Genome Analysis: A Laboratory Manual Series, Volume 4, Mapping Genomes, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1999),

[0147] Harlow et al., Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1988),

[0148] Harlow et al., Using Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1999),

[0149] Hoffman, Cancer and the Search for Selective Biochemical Inhibitors, CRC Press (1999),

[0150] Pratt, The Anticancer Drugs, 2nd edition, Oxford University Press, NY (1994), and

[0151] Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989).

Example 1

[0152] This example describes the sequencing of the KLH-2 cDNA.

[0153] Fresh limpet tissue associated with the respiratory system and other organ tissues were dissected and immediately frozen on dry ice. The tissue was stored at −70° C.

[0154] Two hundred mg of tissue were homogenized with a PowerGen 125 (Fisher Scientific, Pittsburgh, Pa.) in 3.0 ml Trizol Reagent (Life Technologies, Gaithersburg, Md.). The sample was centrifuged at 15,000×g at 4° C. for 10 min. The sample was allowed to come to room temperature (15-30° C.) for two min. The aqueous layer was extracted twice with 0.3 ml of chloroform. The RNA was then precipitated with isopropanol. The RNA pellets were washed with 75% ethanol, dried and dissolved in distilled water. The RNA solution was stored at −70° C.

[0155] cDNA was prepared using the Thermoscript™ RT-PCR system (Life Technologies, Gaithersburg, Md.) following the manufacturer's recommended directions. The cDNA was primed with oligo dT.

[0156] Degenerate PCR primers were designed using the amino acid sequence of the KLH functional unit 2c (Sohngen et al., Eur. J. Biochem. 248: 602-614 (1997); and Stoeva et al., Biochim. Biophys. Acta 1435: 94-109 (1999)).

[0157] The design of the degenerate PCR primers took into consideration codon usage frequency. The Codon Usage Database is an extended WWW version of CUTG (Codon Usage Tabulated from GenBank) and was used to determine codon usage.

[0158] Codon frequency tables were also generated using the Countcodon program (version 4). Octopus dolfleini was used as a model hemocyanin and the codon usage correlated well with other standard codon usage tables. In addition, the degenerate PCR primers were designed to avoid, when possible, amino acids with numerous codons, e.g., leucine, arginine and serine.

[0159] IUB group codes were used to identify redundancies.

[0160] R=A+G

[0161] Y=C+T

[0162] M=A+C

[0163] K=G+T

[0164] S=G+C

[0165] W=A+T

[0166] H=A+T+C

[0167] B=G+T+C

[0168] D=G+A+T

[0169] N=A+G+T+C

[0170] V=G+A+C Primer 12 k1h2cSns-lessdeg71 TTYCCWCAYTGGCAY [SEQ ID NO: 13] Primer 11 k1h2cSns-deg71 TTYCCNCAYTGGCAY [SEQ ID NO: 14] Primer 30 k1h2cAnti-lessdeg223 TTGCCAAATDGGCCA [SEQ ID NO: 15] Primer 15 k1h2cSns-deg96 CNTAYTGGGAYTGGAC [SEQ ID NO: 16] Primer 53 K1h2cAnti-deg354 GGGATYTCNGCYTCRTC [SEQ ID NO: 17]

[0171] PCR reactions were done with the following conditions:

[0172] 94° C., 15 sec

[0173] 52° C., 30 sec

[0174] 72° C., 1 min

[0175] 35 cycles

[0176] 72° C., 10 min

[0177] 4° C. Hold

[0178] Primer 12+Primer 53 yielded PCR product of approximately 850 base pairs. Plasmid 3.1

[0179] Primer 11+Primer 30 yielded PCR product of approximately 460 base pairs. Plasmid 6.2

[0180] Primer 11+Primer 30 yielded PCR product of approximately 460 base pairs. Plasmid 6.1

[0181] Primer 12+Primer 30 yielded PCR product of approximately 460 base pairs. Plasmid 8.1

[0182] Primer 12+Primer 30 yielded PCR product of approximately 460 base pairs. Plasmid 8.2

[0183] Primer 15+Primer 30 yielded PCR product of approximately 380 base pairs. Plasmid 10.2

[0184] PCR products were separated on a 1% agarose gel. Bands of the expected size were excised with a razor blade.

[0185] The amplified DNA was extracted from the agarose using an Ultrafree-DA (Millipore, Bedford, Mass.) spin column. The gel-purified fragments were then cloned into pGEM-T Easy vector (Promega, Madison, Wis.) according to the manufacturer's recommended directions.

[0186] Cells were transformed with pGEM-T easy vector containing PCR product. Max efficiency DH5 alpha chemically competent cells (Life Technologies) were used according to the manufacturer's recommended directions.

[0187] After transformation, colonies were picked and cultured in LB broth containing 100 μg/ml of ampicillin. Plasmids were purified using the QIAprep Spin Miniprep Kit (Qiagen, Valencia, Calif.) according to the manufacturer's directions.

[0188] Plasmids were screened for inserts by digesting with Not I and subsequent gel electrophoresis. Plasmids containing inserts of the expected size were sequenced.

[0189] Sequence data were translated and clones of the open reading frames were compared with other hemocyanins using the BLAST sequence analysis program. Clones containing homology to hemocyanins found in the database were used to make primers for 5′ and 3′ RACE reactions.

[0190] 5′ and 3′ RACE reactions were performed using the SMART™ RACE cDNA Amplification Kit (Catalog K 1811-1, Clontech) according to the manufacturer's recommended protocols.

[0191] 5′ RACE primers: Primer 1 CCATAGCGTCAAACAGATGCGTGTGGTGTCC [SEQ ID NO: 18] Primer 2 GCAAAGCTTGCCAAATGGCCCAAAGTCG [SEQ ID NO: 19]

[0192] 5′RACE reaction products were separated on a 1% agarose gel. The fragments corresponding to the expected size were gel purified using an Ultrafree-DA spin column (Millipore Corporation, Bedford, Mass.) and TA cloned into TOPOXL (Invitrogen, Carlsbad, Calif.) and then used to transform TOP 10 (Invitrogen) electrocompetent cells. Colonies from the transformation reaction were used to inoculate 2 ml LB broth containing 50 μg/ml of kanamycin for plasmid minipreps (QIAprep Spin Miniprep Kit, Qiagen). Plasmids were digested with Eco RI to determine inserts of the expected size.

[0193] RACE Thermal cycling conditions:

[0194] 1) 94° C., 30 seconds

[0195] 2) 72° C., 3 minutes

[0196] 3) Repeat steps 1 and 2, 5 times

[0197] 4) 94° C., 30 seconds

[0198] 5) 70° C., 30 seconds

[0199] 6) 72° C., 30 seconds

[0200] 7) repeat steps 4 through 6, 5 times

[0201] 8) 94° C., 30 seconds

[0202] 9) 68° C., 30 seconds

[0203] 10) 72° C., 3 minutes

[0204] 11) Repeat steps 8-10, 25 times

[0205] 12) 4° C. hold

[0206] Plasmids

[0207] 4-2 generated by 5′ RACE primer 2

[0208] 7-4 generated by 5′ RACE primer 2

[0209] 7-6 generated by 5′ RACE primer 2

[0210] 6-3 generated by 5′ RACE primer 1

[0211] were sequenced. Plasmids containing overlapping sequence corresponding to the previously cloned portion of the KLH2 cDNA were sequenced.

[0212] 3′ RACE primers: 3′ RACE primer 1 GAGCGCCCTTCCACATTTTGTTG [SEQ ID NO: 20] 3′ RACE primer 2 AGCGCCCTTCCACATTTTGTTGC [SEQ ID NO: 21]

[0213] 3′RACE reaction products were separated on a 1% agarose gel. The fragments corresponding to the expected size were gel-purified using an Ultrafree-DA spin column (Amicon, Millipore) and TA cloned into TOPOXL (Invitrogen) and then used to transform TOP10 (Invitrogen) electrocompetent cells. Colonies from the transformation reaction were used to inoculate 2 ml LB broth containing 50 μg/ml of kanamycin for plasmid minipreps (QIAprep Spin Miniprep Kit) (Qiagen). Plasmids were digested with Eco RI to determine inserts of the expected size.

[0214] 3′ RACE plasmids 1-3, 1-4, and 2-5 were sequenced. The overlapping 5′ and 3′ RACE clones contained the complete cDNA sequence of KLH isotype 2 and, in addition, part of the 5′ untranslated region, including the N-terminal methionine, a leader peptide sequence for secretion of the KLH-2 protein, a 3′ untranslated region and the 3′ poly A tail.

Example 2

[0215] This example describes the cloning and sequencing of the KLH-1 cDNA.

[0216] Fresh limpet tissue associated with the respiratory system and other organ tissues were dissected and immediately frozen on dry ice. The tissue was stored at −70° C.

[0217] Two hundred mg of tissue were homogenized with a PowerGen 125 (Fisher Scientific) in 3.0 ml Trizol Reagent (Life Technologies). The sample was centrifuged at 15,000×g at 4° C. for 10 min. The sample was allowed to come to room temperature (15-30° C.) for two min. The aqueous layer was extracted twice with 0.3 ml of chloroform. The RNA was then precipitated with isopropanol. The RNA pellets were washed with 75% ethanol, dried and dissolved in distilled water. The RNA solution was stored at −70° C.

[0218] Hemocyanin sequences for Megathura crenulata found on NCBI's GenBank database were used to make primers for 5′ and 3′ RACE reactions (International Patent Application WO 01/14536, SEQ ID NO: 210, 211, and 217). 5′ and 3′ RACE reactions were performed using the SMART™ RACE cDNA Amplification Kit (Catalog K 1811-1, CLONTECH) according to the manufacturer's recommended protocols. 5′ RACE primer: Primer 1 CGGGGTGACAGTTGCTGCTATCGGTGCA [SEQ ID NO: 34] 3′ RACE primer: Primer 2 GCCTTGACCGCCCATGGTGCCAGA [SEQ ID NO: 35] PCR Primers: Primer 3 GGGGCGACCGAGTTGTATCCACTCCCA [SEQ ID NO: 36] Primer 4 GGCTGGAACATTTGCAGTGCTCGGTGG [SEQ ID NO: 37]

[0219] PCR reactions were performed with 5′ RACE-ready cDNA with primers 3 and 4 under the following thermal cycling conditions:

[0220] 1) 94° C., 30 seconds

[0221] 2) 72° C., 10 minutes

[0222] 3) Repeat steps 1 and 2, 5 times

[0223] 4) 94° C., 30 seconds

[0224] 5) 70° C., 30 seconds

[0225] 6) 72° C., 10 minutes

[0226] 7) repeat steps 4 through 6, 5 times

[0227] 8) 94° C., 30 seconds

[0228] 9) 68° C., 30 seconds

[0229] 10) 72° C., 10 minutes

[0230] 11) Repeat steps 8-10, 25 times

[0231] 12) 72° C., 10 minutes

[0232] 13) 4° C. hold

[0233] 5′ RACE thermal cycling conditions for Primer 1:

[0234] 1) 94° C., 15 seconds

[0235] 2) Gradient (69.5-72° C.) 6 minutes

[0236] 3) Repeat steps 1 and 2, 5 times

[0237] 4) 94° C., 15 seconds

[0238] 5) 70° C., 30 seconds

[0239] 6) 72° C., 6 minutes

[0240] 7) repeat steps 4 through 6, 5 times

[0241] 8) 94° C., 15 seconds

[0242] 9) 68° C., 30 seconds

[0243] 10) 72° C., 6 minutes

[0244] 11) Repeat steps 8-10, 29 times

[0245] 12) 72° C., 10 minutes

[0246] 13) 4° C. hold

[0247] 3′ RACE Thermal cycling conditions for Primer 2:

[0248] 1) 94° C., 30 seconds

[0249] 2) 70° C., 10 minutes

[0250] 3) Repeat steps 1 and 2, 5 times

[0251] 4) 94° C., 30 seconds

[0252] 5) 68° C., 30 seconds

[0253] 6) 72° C., 10 minutes

[0254] 7) repeat steps 4 through 6, 5 times

[0255] 8) 94° C., 30 seconds

[0256] 9) 66° C., 30 seconds

[0257] 10) 72° C., 10 minutes

[0258] 11) Repeat steps 8-10, 25 times

[0259] 12) 72° C., 10 minutes

[0260] 13) 4° C. hold

[0261] 5′ RACE reaction yielded a product of approximately 3500 bp. 3′ RACE reaction yielded a product of approximately 2900 bp. PCR reactions with 316A and 302B primers yielded PCR product of approximately 5600 bp. Reaction products were separated on a 1% agarose gel. The fragments corresponding to the expected size were gel-purified using an Ultrafree-DA spin column (Amicon, Millipore, Bedford, Mass.) and TA cloned into TOPOXL (Invitrogen) and then used to transform TOP10 (Invitrogen) electrocompetent cells. Colonies from the transformation reaction were used to inoculate 2 ml LB broth containing 50 μg/ml of kanamycin for plasmid minipreps (QIAprep Spin Miniprep Kit)(Qiagen). Plasmids were digested with Eco RI and Not 1 (Promega, Madison, Wis.) to determine inserts of the expected size.

[0262] Plasmids 5-4, 16-1, and 12-3 were sequenced. The overlapping 5′ and 3′ RACE clones contained the complete cDNA sequence of KLH-1 and, in addition, part of the 5′ untranslated region, including the N-terminal methionine, a leader peptide sequence for secretion of the KLH-1 protein, a 3′ untranslated region and the 3′ poly A tail.

Example 3

[0263] This example describes the construction of vectors containing full-length KLH1 and KLH2 cDNA.

[0264] RNA extraction was performed as described in Example 1.

[0265] cDNA was prepared using the SMART cDNA Synthesis Kit (Catalog K1052-1, CLONTECH) according to the manufacturer's recommended protocols. KLH2 Primer 1 (forward): 5′ to 3′ AGGCTACGCGTTCGAAGGAGATAGAACCATGTGGACCATCTTGGCTCTC [SEQ ID NO: 38] KLH2 Primer 2 (reverse): 5′ to 3′ CTGGTGCGGCCGCCTTACTCGTCTTCAATGTGGATATG [SEQ ID NO: 39] KLH1 Primer 1 (forward): 5′ to 3′ ATGCTGTCGGTCAGGTTGCTTATAGTCG [SEQ ID NO: 40] KLH1 Primer 2 (reverse): 5′ to 3′ CTATTCGTCATCCACATGGACATGTATCC [SEQ ID NO: 41]

[0266] KLH1 and KLH2 thermal cycling conditions:

[0267] 1) 95° C., 2 minutes

[0268] 2) 94° C., 20 seconds

[0269] 3) Gradient (58-70° C.) 30 seconds

[0270] 4) 72° C., 10 minutes

[0271] 5) Repeat steps 2 through 4, 29 times

[0272] 6) 72° C., 10 minutes

[0273] 7) 4° C. hold

[0274] KLH1 and KLH2 PCR reactions yielded products of approximately 10,000 bp. PCR products were separated on a 1% agarose gel. The fragments were gel-purified using an Ultrafree-DA spin column (Amicon, Millipore, Bedford, Mass.), TA cloned into pCR-XL-TOPO (Invitrogen) using the TOPOXL PCR Cloning Kit (Catalog K4700-10, Invitrogen), and used to transform TOP10 (Invitrogen) electrocompetent cells. Colonies from the transformation reaction were used to inoculate 2 mL LB broth containing 50 μg/mL of kanamycin for plasmid minipreps (QIAprep Spin Miniprep Kit, QIAGEN).

[0275] KLH1 plasmids were digested with Not1 restriction enzyme (New England Biolabs). KLH2 plasmids were digested with Mlu1 and Not1 restriction enzymes (New England Biolabs). Digests were separated on a 1% agarose gel to determine inserts of the expected size.

[0276] KLH2 plasmid 13 and KLH1 plasmid 5-1 were each sequenced. Two-three kilobases of both of the 5′ and 3′ ends of each insert were sequenced for confirmation. The plasmids were found to contain the coding regions of KLH2 and KLH1, respectively.

[0277] All of the references cited herein, including patents, patent applications, and publications, are hereby incorporated in their entireties by reference.

[0278] While this invention has been described with an emphasis upon preferred embodiments, it will be obvious to those of ordinary skill in the art that variations of the preferred embodiments may be used and that it is intended that the invention may be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications encompassed within the spirit and scope of the invention as defined by the following claims.

1 41 1 10206 DNA Megathura crenulata 1 gtagacacag tggtgagaaa gaatgtggac agtttgagca gtgacgaggt tctagccctc 60 gagaaggctc tcgatgatct ccagcaagac gacagcaacc agggttatca ggccatagca 120 ggctaccatg gtgttccaac aatgtgtgtg gacaaacatg aaaaaaatgt ggcctgttgt 180 cttcacggta tgccatcctt tcccctttgg catcgtctct acgttgtcca gttggaaaga 240 gcgctgataa ggaagaaggc aaccatcagc atcccctact gggattggac tagtgagctc 300 acccatcttc cagaacttgt gtctcatcca ctgtttgtcg acactgaggg gggcaaggct 360 catgacaatt cttggtaccg agctgacatc actttcctca acaagaagac aagtcgtgct 420 gtagatgacc gtctttttga aaaagtacag ccaggacacc acactcgcct catggaggga 480 atccttgacg ctctggaaca ggacgagttt tgcaaatttg agatccagtt tgagctggct 540 cacaacgcca tacattatct agttgggggc agacacacat attccatgtc tcacctcgag 600 tacacgtcat acgatccttt gttcttttta catcacttca acactgaccg tatctttgct 660 atatggcaac gacttcaaca gctcagaggc aaggatccca acagtgctga ctgtgcccat 720 aaccttattc acactccaat ggaaccgttc gacagagaca ctaatccact ggatctgaca 780 agagagcacg ccaaacctgc tgatagcttt gactacgggc gcctcggcta ccagtatgac 840 gacctctcct tgaatggcat gagtccagag gaactcaacg tatatctagg agaacgagca 900 gctaaggagc gtacatttgc cagcttcata ttatctggtt ttggtggctc tgccaatgtt 960 gtagtctatg tctgccgacc tgctcatgat gaaatcgctg atgatcaatg tatcaaagca 1020 ggagatttct tccttcttgg cggaccaact gaaatgaagt ggggattcta cagggcatac 1080 cactttgatg ttactgacag cgttgcttct atagatgatg atggtcatgg gcattattac 1140 gtgaagtccg agttattcag cgtcaacgga agcgctttgt caaatgacat tcttcgacaa 1200 cccaccttag tgcaccgacc agctaaaggt cattttgata agcctccagt tccagttgct 1260 caagctaacc tggctgtcag aaagaatatc aacgatctaa ctgcagagga aacatacagc 1320 ttgaggaaag ctatggaaag attccagaat gactcagttg acgggtacca agctacagta 1380 gaattccatg cccttccagc aagatgtcca cgcccagatg ctaaagatag atttgcatgc 1440 tgcgttcatg gcatggcaac cttcccccac tggcatagac tctttgtcac tcaagttgaa 1500 gatgctctac ttcgtcgcgg gtctactatc ggtctacctt actgggactg gaccatgcca 1560 atggatcatc tgccagaact ggctatgagt gagacctatc cccatccagt ttctagcgca 1620 atcatggcaa atccatttta ccatgccgaa gtggcttttg aaaataacgt cacacacaga 1680 gatacagaca atagactctt tgagaaaccg tcttttggac accacacgca tctgtttgac 1740 gctatggtct atgcttttga acaagaagac ttttgcgact ttgaagttca gtttgaactc 1800 gttcacaata acatacatgc gtgggttgga ggaagtgaaa agtactccat gtcttctctt 1860 cactacactg cttttgatcc tattttctac ctccatcact caaatgttga tcgtctctgg 1920 gccatttggc aagctcttca aatcaggagg aacaaacctt acaaggccca ttgtgcctgg 1980 tcacaggaaa gggaaccatt aaagcctttt tcattcagtt cacctctcaa caacaacgat 2040 aaaacctacc aaaactctgt tccaactaat gtttacgact atgagggagt tttggactat 2100 cgatatgatg acctccagtt tggtggcatg accctgtcag aacttgaaaa atatatacac 2160 aagcagaaac agcatgacag aagttttgca ggattcttcc tttctcatat tggaacatca 2220 gcaaacgtag acgtccttat cagtgcaaag ggccaagagg cccacacgat tggaagcttt 2280 gcagtgcttg gtggatctaa ggaaatgaaa tgggctttcg acagaatgta caaatatgaa 2340 attactgatg ctgccaaaaa gctgaacatt gatgtgaatg atgccactat ttctgttaag 2400 atcactgatg ttgatggaac ttccttgtca ccagagctaa ttccgtctcc tgctctcatg 2460 tttgaacctg ggcatattgt tgccaaagac tttggccaca gcaaaaaaat caggaagaac 2520 gttcattccc tgacagctga agaacaaaat tcgttgaggc gagctatggc agatctccaa 2580 gatgacaaaa cacgaggtgg attccaacag attgcagcat ttcacggaga accaaaatgg 2640 tgcccacgcc ctgaagcgga gaagaaattt gcatgctgcg ttcacggaat ggctgttttc 2700 cctcactggc acagattgct gacagttcaa ggagaaaatg ctctgaggaa acatggattt 2760 actggtggat tgccctattg ggactggact cgaccaatga gcgcccttcc acattttgtt 2820 gctgatccta cttacaatga ttctgtttcc agcctcgaag aagataaccc atggtatcat 2880 ggtcacatag attctgttgg gcatgatact acaagagctg tgcgtgatga tctttatcaa 2940 tctcctggtt tcggtcacta cacagatatt gcaaaacaag tccttctggc ctttgagcag 3000 gacgatttct gtgattttga ggtacaattt gaaattgccc ataatttcat acatgctcta 3060 gttggtggta acgaaccata cagtatgtca tctttgaggt atactacata cgatccaatc 3120 ttcttcttgc accactccaa tacagaccga ctttgggcca tttggcaagc tttgcaaaaa 3180 taccggggga aaccatacaa cactgcaaac tgtgccattg catccatgag aaaaccactt 3240 cagccatttg gtcttgatag tgtcataaat ccagatgacg aaactcgtga acattcggtt 3300 cctttccgag tcttcgacta caagaacaac ttcgactatg agtatgagag cctggcattt 3360 aatggtctgt ctattgccca actggaccga gagttgcaga gaagaaagtc acatgacaga 3420 gtctttgcag gattccttct tcatgaaatt ggacagtctg cactcgtgaa attctacgtt 3480 tgcaaacaca atgtatctga ctgtgaccat tatgctggag aattctacat tttgggagat 3540 gaagctgaga tgccttggag gtatgaccgt gtgtacaagt acgagataac acagcagctg 3600 cacgatttag atctacatgt tggagataat ttctttctta aatatgaagc ctttgatctg 3660 aatggcggaa gtcttggtgg aagtatcttt tctcagcctt cggtgatttt cgagccaact 3720 gcaggttcac accaggctga tgaatatcgt gaggcagtaa caagcgctag ccacataaga 3780 aaaaatatcc gggacctctc agagggagaa attgagagca tcagatctgc tttcctccaa 3840 attcaaaaag agggtatata tgaaaacatt gcaaagttcc atggaaaacc aggactttgt 3900 gaatatgatg gacatcctgt tgcttgttgt gtccatggca tgcccacctt tccccactgg 3960 cacagactgt acgttcttca ggtggagaat gcgctcttag aacgagggtc tgcagttgct 4020 gttccttact gggactggac cgagaaagct gactctctgc catcattaat caatgatgca 4080 acttatttca attcacgatc ccagaccttt gatcctaatc ctttcttcag gggacatatc 4140 gccttcgaga atgctgtgac gtccagagat cctcagccag aactatggga caataaggac 4200 ttctacgaga atgtcatgct ggctcttgag caagacaact tctgtgactt tgagattcag 4260 cttgagctga tacacaacgc ccttcattct agacttggag gaagggctaa atactccctt 4320 tcgtctcttg attataccgc atttgatcct gtatttttcc ttcaccatgc aaacgttgac 4380 agaatctggg ccatctggca ggacttgcag agatatagaa agaaaccata caatgaggct 4440 gactgcgcag tcaacgagat gcgtaaacct cttcaaccat ttaataaccc agaacttaac 4500 agtgattcca tgacgcttaa acacaacctc ccacaagaca gttttgatta tcaaaaccgc 4560 ttcaggtacc aatatgataa ccttcaattt aaccacttca gcatacaaaa gctagaccaa 4620 actattcagg ctagaaaaca acacgacaga gtttttgctg gctttattct tcacaacatt 4680 gggacatctg ttgttgtaga tatttatatt tgcgttgaac aaggaggaga acaaaactgc 4740 aagacaaagg cgggttcctt cacgattctg gggggagaaa cagaaatgcc attccacttt 4800 gaccgcttgt acaaatttga cataacgtct gctctgcata aacttggtgt tcccttggac 4860 ggacatggat tcgacatcaa agttgacgtc agagctgtca atggatcgca tcttgatcca 4920 cacatcctca acgaaccgag tctgcttttt gttcctggtg aacattcaca tcaaactgca 4980 gatgggcttt cacaacataa tcttgtgcga aaagaagtaa gctctcttac aacactggag 5040 aaacattttt tgaggaaagc tctcaagaac atgcaagcag atgattctcc agacggatat 5100 caagctattg cttctttcca cgctttgcct cctctttgtc caagtccatc tgctgcacat 5160 agacacgctt gttgcctcca tggtatggct accttccctc agtggcacag actctacaca 5220 gttcagttcg aagattcttt gaaacgacat ggttctattg tcggacttcc atattgggat 5280 tggctgaaac cgcagtctgc actccctgat ttggtgacac aggagacata cgagcacctg 5340 ttttcacaca aaaccttccc aaatccgttc ctcaaggcaa atatagaatt tgagggagag 5400 ggagtaacaa cagagaggga tgttgatgct gaacacctct ttgcaaaagg aaatctggtt 5460 tacaacaact ggttttacaa tcaggcacta tatgcactag aacaagaaaa ttactgtgac 5520 tttgaaatac agttcgaaat tttgcataat ggaattcatt catgggttgg aggatcaaag 5580 acccattcaa taggtcatct tcattacgca tcatacgatc cactgttcta tatccaccat 5640 tcgcagacag atcgcatttg ggctatctgg caagctttcc aggagcacag aggtctttca 5700 gggaaggaag cacactgcgc cctggagcaa atgaaagacc ctctcaaacc tttcagcttt 5760 ggaagtccct ataatttgaa caaacgcact caagagttct ccaagcctga agacacattt 5820 gattatcacc gattcgggta tgagtatgat tccctcgaat ttgttggcat gtctgtttca 5880 agtttacata actatataaa acaacaacag gaagctgata gagtcttcgc aggattcctt 5940 cttaaaggat ttggacaatc agcatccgta tcgtttgata tctgcagacc agaccagagt 6000 tgccaagaag ctggatactt ctcagttctc ggtggaagtt cagaaatgcc gtggcagttt 6060 gacaggcttt acaagtacga cattacaaaa acgttgaaag acatgaaact gcgatacgat 6120 gacgcattta ccatcaaggt tcacataaag gatatagctg gagctgagtt ggacagcgat 6180 ctgattccaa ctccttctgt tctctttgaa gaaggaaagc atgggatcaa tgtacgtcac 6240 gttggtcgta atcggattcg tatggaacta tctgaactca ccgagagaga tctcgccagc 6300 ctgaaatctg caatgaggtc tctacaagct gacgatgggg tgaacggtta tcaagccatt 6360 gcatcattcc acggtctccc ggcttcttgt catgatgatg agggacatga gattgcctgt 6420 tgtatccacg gaatgccagt attcccacac tggcacaggc tttacaccct gcaaatggac 6480 atggctctgt tatctcacgg atctgctgtt gctattccat actgggactg gaccaaacct 6540 atcagcaaac tgcctgatct cttcaccagc cctgaatatt acgatccttg gagggatgca 6600 gttgtcaata atccatttgc taaaggctac attaaatccg aggacgctta cacggttagg 6660 gatcctcagg acattttgta ccacttgcag gacgaaacgg gaacatctgt tttgttagat 6720 caaactcttt tagccttaga acagacagat ttctgtgatt ttgaggttca atttgaggtc 6780 gtccataatg ctattcacta cttggtgggt ggtcgacaag tttatgctct ttcttctcta 6840 cactatgctt catatgaccc agccttcttt attcatcact cctttgttga caaaatatgg 6900 gcagtctggc aagctctgca aaagaagaga aagcgtccct atcataaagc ggattgtgct 6960 cttaacatga tgaccaaacc aatgcgacca tttgcacacg atttcaatca caatggattc 7020 acaaaaatgc acgcagtccc caacactcta tttgactttc aggacctttt ctacacgtat 7080 gacaacttag aaattgctgg catgaatgtt aatcagttgg aagcggaaat caaccggcga 7140 aaaagccaaa caagagtctt tgccgggttc cttctacatg gcattggaag atcagctgat 7200 gtacgatttt ggatttgcaa gacagctgac gactgccacg catctggcat gatctttatc 7260 ttaggaggtt ctaaagagat gcactgggcc tatgacagga actttaaata cgacatcacc 7320 caagctttga aggctcagtc catacaccct gaagatgtgt ttgacactga tgctcctttc 7380 ttcattaaag tggagatcca tggtgtaaac aagactgctc tcccatcttc agctatccca 7440 gcacctacta taatctactc agctggtgaa ggtcataccg atgatcacgg ctcagatcat 7500 attgctggca gtggagtcag gaaagacgtg acgtctctta ccgcatctga gatagagaac 7560 ctgaggcatg ctctgcaaag cgtgatggat gatgatggac ccaatggatt ccaggcaatt 7620 gctgcttatc acggaagtcc tcccatgtgt cacatgcctg atggtagaga cgttgcatgt 7680 tgtactcatg gaatggcatc tttccctcac tggcacagac tgtttgtgaa acagatggag 7740 gatgcactgg ctgcgcatgg agctcacatt ggcataccat actgggattg gacaagtgcg 7800 tttagtcatc tgcctgccct agtgactgac cacgagcaca atcccttcca ccacggacat 7860 attgctcatc ggaatgtgga tacatctcga tctccgagag acatgctgtt caatgaccct 7920 gaacacgggt cagaatcatt cttctataga caggttctct tggctctaga acagacagac 7980 ttctgccaat ttgaagttca gtttgaaata acacacaatg caatccactc ttggactgga 8040 ggacatactc catatggaat gtcatcactg gaatatacag catatgatcc actcttttat 8100 ctccaccatt ccaacactga tcgtatctgg gccatctggc aggcactcca gaaatacaga 8160 ggttttcaat acaacgcagc tcattgcgat atccaggttc tgaaacaacc tcttaaacca 8220 ttcagcgagt ccaggaatcc aaacccagtc accagagcca attctagggc agtcgattca 8280 tttgattatg agagactcaa ttatcaatat gacacactta ccttccacgg acattctatc 8340 tcagaacttg atgccatgct tcaagagaga aagaaggaag agagaacatt tgcagccttc 8400 ctgttgcacg gatttggcgc cagtgctgat gtttcgtttg atgtctgcac acctgatggt 8460 cattgtgcct ttgctggaac cttcgcggta cttggtgggg agctggagat gccctggtcc 8520 tttgaaagat tgttccgtta tgatatcaca aaggttctca agcagatgaa tctttactat 8580 gattctgagt ttcatttcgc gttgaaggtt gttggcacag atggaacaga actgccatcg 8640 gatagtatca agagcccaac aattgagcat catcaagcag atcaccaggg tcatggtggg 8700 gagggtggcc aacatcctga acatactgga tttgtaagga aggacataga ttctctttcc 8760 ttggatgaag ccaatgatct gaaaaatgca ctgtacaagc tgcagaacga ccaaagcctc 8820 agtggatatg aatcaatagc tggtttccat ggctatccct acctctgtcc tgacaacggc 8880 gatgacaagt atgcctgctg tgtccatgga atgcctgtat ttccacactg gcacagactt 8940 cttactatcc agtttgagag agcactcaaa gaacatggtt cacatttggg tgttccctat 9000 tgggattgga ccaagtcaat gactgctctt ccagctttct ttgccgattc ttccaataac 9060 aaccctttct acaaatatca cattatgaaa gctggacatg atacaacgag acagccgaat 9120 gatcttctct tcaatcaacc acaggttcat ggatatgact atctatacta ccttgccctt 9180 tcaacactag aagaggataa ctactgtgac tttgaggttc agtatgaaaa ttttcacaat 9240 gcagtacata tctggctcgg gggaactgaa acctactcca tggctactct tgaatattca 9300 gcgtttgatc cagtctttat gatcttgcat tctggtctcg acagactctg gatcatttgg 9360 caaaagctgc aaaagctcag aaagaaacca tataatgcag ccaaatgtgc tggccacatg 9420 attgatgaac ctttgcatcc ttttgactac gagtctgtta accaagatgg cttcactcga 9480 gcaaacgcta aacctacaac agtgtttgac agtcacaggt ttgattatca ctatgacaac 9540 ctggatgtac gaggaaagag tattcaggaa ataaacgcca tcatccacga tttgaggacc 9600 caaccaagaa tctttgctgg ttttgttctt tctggaatat acacttcagc taatgttaag 9660 atatacttaa tcagagaagg ccatgacgat gaatatgtag gaacatttgc agttttagga 9720 ggagccaagg agatgccatg ggcctatgag aggattttca agtatgacat tacagatgtc 9780 gcccacaaac tgaaaatgca ccatgatgat acattcaact tcagacttga aattcaatct 9840 tacagtggcg aaacactaac atcacatctt cctgaacctc tcataatcca cagaccagcc 9900 aacaaggact atgatgtctt gattattcca ttaggaagcg gacgcaagct tcctcctaaa 9960 gtcattgtca agagaggaac agaaattgag tttcatccag tcgatgacac gatcaacaga 10020 cctgttgtag atcttggcag ctacactgct ctctacaact gtgtggtgcc acccttcaca 10080 taccacggat atgagctcga tcatgcctac tcccttaggg atgggcacta ctatattgca 10140 ggcccaacca aagacctctg cgccagcgga aatgttagaa tacatatcca cattgaagac 10200 gagtaa 10206 2 10206 DNA Megathura crenulata SEQ ID NO2 is the complementary sequence of the coding sequence of SEQ ID NO1 2 ttactcgtct tcaatgtgga tatgtattct aacatttccg ctggcgcaga ggtctttggt 60 tgggcctgca atatagtagt gcccatccct aagggagtag gcatgatcga gctcatatcc 120 gtggtatgtg aagggtggca ccacacagtt gtagagagca gtgtagctgc caagatctac 180 aacaggtctg ttgatcgtgt catcgactgg atgaaactca atttctgttc ctctcttgac 240 aatgacttta ggaggaagct tgcgtccgct tcctaatgga ataatcaaga catcatagtc 300 cttgttggct ggtctgtgga ttatgagagg ttcaggaaga tgtgatgtta gtgtttcgcc 360 actgtaagat tgaatttcaa gtctgaagtt gaatgtatca tcatggtgca ttttcagttt 420 gtgggcgaca tctgtaatgt catacttgaa aatcctctca taggcccatg gcatctcctt 480 ggctcctcct aaaactgcaa atgttcctac atattcatcg tcatggcctt ctctgattaa 540 gtatatctta acattagctg aagtgtatat tccagaaaga acaaaaccag caaagattct 600 tggttgggtc ctcaaatcgt ggatgatggc gtttatttcc tgaatactct ttcctcgtac 660 atccaggttg tcatagtgat aatcaaacct gtgactgtca aacactgttg taggtttagc 720 gtttgctcga gtgaagccat cttggttaac agactcgtag tcaaaaggat gcaaaggttc 780 atcaatcatg tggccagcac atttggctgc attatatggt ttctttctga gcttttgcag 840 cttttgccaa atgatccaga gtctgtcgag accagaatgc aagatcataa agactggatc 900 aaacgctgaa tattcaagag tagccatgga gtaggtttca gttcccccga gccagatatg 960 tactgcattg tgaaaatttt catactgaac ctcaaagtca cagtagttat cctcttctag 1020 tgttgaaagg gcaaggtagt atagatagtc atatccatga acctgtggtt gattgaagag 1080 aagatcattc ggctgtctcg ttgtatcatg tccagctttc ataatgtgat atttgtagaa 1140 agggttgtta ttggaagaat cggcaaagaa agctggaaga gcagtcattg acttggtcca 1200 atcccaatag ggaacaccca aatgtgaacc atgttctttg agtgctctct caaactggat 1260 agtaagaagt ctgtgccagt gtggaaatac aggcattcca tggacacagc aggcatactt 1320 gtcatcgccg ttgtcaggac agaggtaggg atagccatgg aaaccagcta ttgattcata 1380 tccactgagg ctttggtcgt tctgcagctt gtacagtgca tttttcagat cattggcttc 1440 atccaaggaa agagaatcta tgtccttcct tacaaatcca gtatgttcag gatgttggcc 1500 accctcccca ccatgaccct ggtgatctgc ttgatgatgc tcaattgttg ggctcttgat 1560 actatccgat ggcagttctg ttccatctgt gccaacaacc ttcaacgcga aatgaaactc 1620 agaatcatag taaagattca tctgcttgag aacctttgtg atatcataac ggaacaatct 1680 ttcaaaggac cagggcatct ccagctcccc accaagtacc gcgaaggttc cagcaaaggc 1740 acaatgacca tcaggtgtgc agacatcaaa cgaaacatca gcactggcgc caaatccgtg 1800 caacaggaag gctgcaaatg ttctctcttc cttctttctc tcttgaagca tggcatcaag 1860 ttctgagata gaatgtccgt ggaaggtaag tgtgtcatat tgataattga gtctctcata 1920 atcaaatgaa tcgactgccc tagaattggc tctggtgact gggtttggat tcctggactc 1980 gctgaatggt ttaagaggtt gtttcagaac ctggatatcg caatgagctg cgttgtattg 2040 aaaacctctg tatttctgga gtgcctgcca gatggcccag atacgatcag tgttggaatg 2100 gtggagataa aagagtggat catatgctgt atattccagt gatgacattc catatggagt 2160 atgtcctcca gtccaagagt ggattgcatt gtgtgttatt tcaaactgaa cttcaaattg 2220 gcagaagtct gtctgttcta gagccaagag aacctgtcta tagaagaatg attctgaccc 2280 gtgttcaggg tcattgaaca gcatgtctct cggagatcga gatgtatcca cattccgatg 2340 agcaatatgt ccgtggtgga agggattgtg ctcgtggtca gtcactaggg caggcagatg 2400 actaaacgca cttgtccaat cccagtatgg tatgccaatg tgagctccat gcgcagccag 2460 tgcatcctcc atctgtttca caaacagtct gtgccagtga gggaaagatg ccattccatg 2520 agtacaacat gcaacgtctc taccatcagg catgtgacac atgggaggac ttccgtgata 2580 agcagcaatt gcctggaatc cattgggtcc atcatcatcc atcacgcttt gcagagcatg 2640 cctcaggttc tctatctcag atgcggtaag agacgtcacg tctttcctga ctccactgcc 2700 agcaatatga tctgagccgt gatcatcggt atgaccttca ccagctgagt agattatagt 2760 aggtgctggg atagctgaag atgggagagc agtcttgttt acaccatgga tctccacttt 2820 aatgaagaaa ggagcatcag tgtcaaacac atcttcaggg tgtatggact gagccttcaa 2880 agcttgggtg atgtcgtatt taaagttcct gtcataggcc cagtgcatct ctttagaacc 2940 tcctaagata aagatcatgc cagatgcgtg gcagtcgtca gctgtcttgc aaatccaaaa 3000 tcgtacatca gctgatcttc caatgccatg tagaaggaac ccggcaaaga ctcttgtttg 3060 gctttttcgc cggttgattt ccgcttccaa ctgattaaca ttcatgccag caatttctaa 3120 gttgtcatac gtgtagaaaa ggtcctgaaa gtcaaataga gtgttgggga ctgcgtgcat 3180 ttttgtgaat ccattgtgat tgaaatcgtg tgcaaatggt cgcattggtt tggtcatcat 3240 gttaagagca caatccgctt tatgataggg acgctttctc ttcttttgca gagcttgcca 3300 gactgcccat attttgtcaa caaaggagtg atgaataaag aaggctgggt catatgaagc 3360 atagtgtaga gaagaaagag cataaacttg tcgaccaccc accaagtagt gaatagcatt 3420 atggacgacc tcaaattgaa cctcaaaatc acagaaatct gtctgttcta aggctaaaag 3480 agtttgatct aacaaaacag atgttcccgt ttcgtcctgc aagtggtaca aaatgtcctg 3540 aggatcccta accgtgtaag cgtcctcgga tttaatgtag cctttagcaa atggattatt 3600 gacaactgca tccctccaag gatcgtaata ttcagggctg gtgaagagat caggcagttt 3660 gctgataggt ttggtccagt cccagtatgg aatagcaaca gcagatccgt gagataacag 3720 agccatgtcc atttgcaggg tgtaaagcct gtgccagtgt gggaatactg gcattccgtg 3780 gatacaacag gcaatctcat gtccctcatc atcatgacaa gaagccggga gaccgtggaa 3840 tgatgcaatg gcttgataac cgttcacccc atcgtcagct tgtagagacc tcattgcaga 3900 tttcaggctg gcgagatctc tctcggtgag ttcagatagt tccatacgaa tccgattacg 3960 accaacgtga cgtacattga tcccatgctt tccttcttca aagagaacag aaggagttgg 4020 aatcagatcg ctgtccaact cagctccagc tatatccttt atgtgaacct tgatggtaaa 4080 tgcgtcatcg tatcgcagtt tcatgtcttt caacgttttt gtaatgtcgt acttgtaaag 4140 cctgtcaaac tgccacggca tttctgaact tccaccgaga actgagaagt atccagcttc 4200 ttggcaactc tggtctggtc tgcagatatc aaacgatacg gatgctgatt gtccaaatcc 4260 tttaagaagg aatcctgcga agactctatc agcttcctgt tgttgtttta tatagttatg 4320 taaacttgaa acagacatgc caacaaattc gagggaatca tactcatacc cgaatcggtg 4380 ataatcaaat gtgtcttcag gcttggagaa ctcttgagtg cgtttgttca aattataggg 4440 acttccaaag ctgaaaggtt tgagagggtc tttcatttgc tccagggcgc agtgtgcttc 4500 cttccctgaa agacctctgt gctcctggaa agcttgccag atagcccaaa tgcgatctgt 4560 ctgcgaatgg tggatataga acagtggatc gtatgatgcg taatgaagat gacctattga 4620 atgggtcttt gatcctccaa cccatgaatg aattccatta tgcaaaattt cgaactgtat 4680 ttcaaagtca cagtaatttt cttgttctag tgcatatagt gcctgattgt aaaaccagtt 4740 gttgtaaacc agatttcctt ttgcaaagag gtgttcagca tcaacatccc tctctgttgt 4800 tactccctct ccctcaaatt ctatatttgc cttgaggaac ggatttggga aggttttgtg 4860 tgaaaacagg tgctcgtatg tctcctgtgt caccaaatca gggagtgcag actgcggttt 4920 cagccaatcc caatatggaa gtccgacaat agaaccatgt cgtttcaaag aatcttcgaa 4980 ctgaactgtg tagagtctgt gccactgagg gaaggtagcc ataccatgga ggcaacaagc 5040 gtgtctatgt gcagcagatg gacttggaca aagaggaggc aaagcgtgga aagaagcaat 5100 agcttgatat ccgtctggag aatcatctgc ttgcatgttc ttgagagctt tcctcaaaaa 5160 atgtttctcc agtgttgtaa gagagcttac ttcttttcgc acaagattat gttgtgaaag 5220 cccatctgca gtttgatgtg aatgttcacc aggaacaaaa agcagactcg gttcgttgag 5280 gatgtgtgga tcaagatgcg atccattgac agctctgacg tcaactttga tgtcgaatcc 5340 atgtccgtcc aagggaacac caagtttatg cagagcagac gttatgtcaa atttgtacaa 5400 gcggtcaaag tggaatggca tttctgtttc tccccccaga atcgtgaagg aacccgcctt 5460 tgtcttgcag ttttgttctc ctccttgttc aacgcaaata taaatatcta caacaacaga 5520 tgtcccaatg ttgtgaagaa taaagccagc aaaaactctg tcgtgttgtt ttctagcctg 5580 aatagtttgg tctagctttt gtatgctgaa gtggttaaat tgaaggttat catattggta 5640 cctgaagcgg ttttgataat caaaactgtc ttgtgggagg ttgtgtttaa gcgtcatgga 5700 atcactgtta agttctgggt tattaaatgg ttgaagaggt ttacgcatct cgttgactgc 5760 gcagtcagcc tcattgtatg gtttctttct atatctctgc aagtcctgcc agatggccca 5820 gattctgtca acgtttgcat ggtgaaggaa aaatacagga tcaaatgcgg tataatcaag 5880 agacgaaagg gagtatttag cccttcctcc aagtctagaa tgaagggcgt tgtgtatcag 5940 ctcaagctga atctcaaagt cacagaagtt gtcttgctca agagccagca tgacattctc 6000 gtagaagtcc ttattgtccc atagttctgg ctgaggatct ctggacgtca cagcattctc 6060 gaaggcgata tgtcccctga agaaaggatt aggatcaaag gtctgggatc gtgaattgaa 6120 ataagttgca tcattgatta atgatggcag agagtcagct ttctcggtcc agtcccagta 6180 aggaacagca actgcagacc ctcgttctaa gagcgcattc tccacctgaa gaacgtacag 6240 tctgtgccag tggggaaagg tgggcatgcc atggacacaa caagcaacag gatgtccatc 6300 atattcacaa agtcctggtt ttccatggaa ctttgcaatg ttttcatata taccctcttt 6360 ttgaatttgg aggaaagcag atctgatgct ctcaatttct ccctctgaga ggtcccggat 6420 attttttctt atgtggctag cgcttgttac tgcctcacga tattcatcag cctggtgtga 6480 acctgcagtt ggctcgaaaa tcaccgaagg ctgagaaaag atacttccac caagacttcc 6540 gccattcaga tcaaaggctt catatttaag aaagaaatta tctccaacat gtagatctaa 6600 atcgtgcagc tgctgtgtta tctcgtactt gtacacacgg tcatacctcc aaggcatctc 6660 agcttcatct cccaaaatgt agaattctcc agcataatgg tcacagtcag atacattgtg 6720 tttgcaaacg tagaatttca cgagtgcaga ctgtccaatt tcatgaagaa ggaatcctgc 6780 aaagactctg tcatgtgact ttcttctctg caactctcgg tccagttggg caatagacag 6840 accattaaat gccaggctct catactcata gtcgaagttg ttcttgtagt cgaagactcg 6900 gaaaggaacc gaatgttcac gagtttcgtc atctggattt atgacactat caagaccaaa 6960 tggctgaagt ggttttctca tggatgcaat ggcacagttt gcagtgttgt atggtttccc 7020 ccggtatttt tgcaaagctt gccaaatggc ccaaagtcgg tctgtattgg agtggtgcaa 7080 gaagaagatt ggatcgtatg tagtatacct caaagatgac atactgtatg gttcgttacc 7140 accaactaga gcatgtatga aattatgggc aatttcaaat tgtacctcaa aatcacagaa 7200 atcgtcctgc tcaaaggcca gaaggacttg ttttgcaata tctgtgtagt gaccgaaacc 7260 aggagattga taaagatcat cacgcacagc tcttgtagta tcatgcccaa cagaatctat 7320 gtgaccatga taccatgggt tatcttcttc gaggctggaa acagaatcat tgtaagtagg 7380 atcagcaaca aaatgtggaa gggcgctcat tggtcgagtc cagtcccaat agggcaatcc 7440 accagtaaat ccatgtttcc tcagagcatt ttctccttga actgtcagca atctgtgcca 7500 gtgagggaaa acagccattc cgtgaacgca gcatgcaaat ttcttctccg cttcagggcg 7560 tgggcaccat tttggttctc cgtgaaatgc tgcaatctgt tggaatccac ctcgtgtttt 7620 gtcatcttgg agatctgcca tagctcgcct caacgaattt tgttcttcag ctgtcaggga 7680 atgaacgttc ttcctgattt ttttgctgtg gccaaagtct ttggcaacaa tatgcccagg 7740 ttcaaacatg agagcaggag acggaattag ctctggtgac aaggaagttc catcaacatc 7800 agtgatctta acagaaatag tggcatcatt cacatcaatg ttcagctttt tggcagcatc 7860 agtaatttca tatttgtaca ttctgtcgaa agcccatttc atttccttag atccaccaag 7920 cactgcaaag cttccaatcg tgtgggcctc ttggcccttt gcactgataa ggacgtctac 7980 gtttgctgat gttccaatat gagaaaggaa gaatcctgca aaacttctgt catgctgttt 8040 ctgcttgtgt atatattttt caagttctga cagggtcatg ccaccaaact ggaggtcatc 8100 atatcgatag tccaaaactc cctcatagtc gtaaacatta gttggaacag agttttggta 8160 ggttttatcg ttgttgttga gaggtgaact gaatgaaaaa ggctttaatg gttccctttc 8220 ctgtgaccag gcacaatggg ccttgtaagg tttgttcctc ctgatttgaa gagcttgcca 8280 aatggcccag agacgatcaa catttgagtg atggaggtag aaaataggat caaaagcagt 8340 gtagtgaaga gaagacatgg agtacttttc acttcctcca acccacgcat gtatgttatt 8400 gtgaacgagt tcaaactgaa cttcaaagtc gcaaaagtct tcttgttcaa aagcatagac 8460 catagcgtca aacagatgcg tgtggtgtcc aaaagacggt ttctcaaaga gtctattgtc 8520 tgtatctctg tgtgtgacgt tattttcaaa agccacttcg gcatggtaaa atggatttgc 8580 catgattgcg ctagaaactg gatggggata ggtctcactc atagccagtt ctggcagatg 8640 atccattggc atggtccagt cccagtaagg tagaccgata gtagacccgc gacgaagtag 8700 agcatcttca acttgagtga caaagagtct atgccagtgg gggaaggttg ccatgccatg 8760 aacgcagcat gcaaatctat ctttagcatc tgggcgtgga catcttgctg gaagggcatg 8820 gaattctact gtagcttggt acccgtcaac tgagtcattc tggaatcttt ccatagcttt 8880 cctcaagctg tatgtttcct ctgcagttag atcgttgata ttctttctga cagccaggtt 8940 agcttgagca actggaactg gaggcttatc aaaatgacct ttagctggtc ggtgcactaa 9000 ggtgggttgt cgaagaatgt catttgacaa agcgcttccg ttgacgctga ataactcgga 9060 cttcacgtaa taatgcccat gaccatcatc atctatagaa gcaacgctgt cagtaacatc 9120 aaagtggtat gccctgtaga atccccactt catttcagtt ggtccgccaa gaaggaagaa 9180 atctcctgct ttgatacatt gatcatcagc gatttcatca tgagcaggtc ggcagacata 9240 gactacaaca ttggcagagc caccaaaacc agataatatg aagctggcaa atgtacgctc 9300 cttagctgct cgttctccta gatatacgtt gagttcctct ggactcatgc cattcaagga 9360 gaggtcgtca tactggtagc cgaggcgccc gtagtcaaag ctatcagcag gtttggcgtg 9420 ctctcttgtc agatccagtg gattagtgtc tctgtcgaac ggttccattg gagtgtgaat 9480 aaggttatgg gcacagtcag cactgttggg atccttgcct ctgagctgtt gaagtcgttg 9540 ccatatagca aagatacggt cagtgttgaa gtgatgtaaa aagaacaaag gatcgtatga 9600 cgtgtactcg aggtgagaca tggaatatgt gtgtctgccc ccaactagat aatgtatggc 9660 gttgtgagcc agctcaaact ggatctcaaa tttgcaaaac tcgtcctgtt ccagagcgtc 9720 aaggattccc tccatgaggc gagtgtggtg tcctggctgt actttttcaa aaagacggtc 9780 atctacagca cgacttgtct tcttgttgag gaaagtgatg tcagctcggt accaagaatt 9840 gtcatgagcc ttgcccccct cagtgtcgac aaacagtgga tgagacacaa gttctggaag 9900 atgggtgagc tcactagtcc aatcccagta ggggatgctg atggttgcct tcttccttat 9960 cagcgctctt tccaactgga caacgtagag acgatgccaa aggggaaagg atggcatacc 10020 gtgaagacaa caggccacat ttttttcatg tttgtccaca cacattgttg gaacaccatg 10080 gtagcctgct atggcctgat aaccctggtt gctgtcgtct tgctggagat catcgagagc 10140 cttctcgagg gctagaacct cgtcactgct caaactgtcc acattctttc tcaccactgt 10200 gtctac 10206 3 3401 PRT Megathura crenulata SEQ ID NO3 is the deduced amino acid sequence of SEQ ID NO1 3 Val Asp Thr Val Val Arg Lys Asn Val Asp Ser Leu Ser Ser Asp Glu 1 5 10 15 Val Leu Ala Leu Glu Lys Ala Leu Asp Asp Leu Gln Gln Asp Asp Ser 20 25 30 Asn Gln Gly Tyr Gln Ala Ile Ala Gly Tyr His Gly Val Pro Thr Met 35 40 45 Cys Val Asp Lys His Glu Lys Asn Val Ala Cys Cys Leu His Gly Met 50 55 60 Pro Ser Phe Pro Leu Trp His Arg Leu Tyr Val Val Gln Leu Glu Arg 65 70 75 80 Ala Leu Ile Arg Lys Lys Ala Thr Ile Ser Ile Pro Tyr Trp Asp Trp 85 90 95 Thr Ser Glu Leu Thr His Leu Pro Glu Leu Val Ser His Pro Leu Phe 100 105 110 Val Asp Thr Glu Gly Gly Lys Ala His Asp Asn Ser Trp Tyr Arg Ala 115 120 125 Asp Ile Thr Phe Leu Asn Lys Lys Thr Ser Arg Ala Val Asp Asp Arg 130 135 140 Leu Phe Glu Lys Val Gln Pro Gly His His Thr Arg Leu Met Glu Gly 145 150 155 160 Ile Leu Asp Ala Leu Glu Gln Asp Glu Phe Cys Lys Phe Glu Ile Gln 165 170 175 Phe Glu Leu Ala His Asn Ala Ile His Tyr Leu Val Gly Gly Arg His 180 185 190 Thr Tyr Ser Met Ser His Leu Glu Tyr Thr Ser Tyr Asp Pro Leu Phe 195 200 205 Phe Leu His His Phe Asn Thr Asp Arg Ile Phe Ala Ile Trp Gln Arg 210 215 220 Leu Gln Gln Leu Arg Gly Lys Asp Pro Asn Ser Ala Asp Cys Ala His 225 230 235 240 Asn Leu Ile His Thr Pro Met Glu Pro Phe Asp Arg Asp Thr Asn Pro 245 250 255 Leu Asp Leu Thr Arg Glu His Ala Lys Pro Ala Asp Ser Phe Asp Tyr 260 265 270 Gly Arg Leu Gly Tyr Gln Tyr Asp Asp Leu Ser Leu Asn Gly Met Ser 275 280 285 Pro Glu Glu Leu Asn Val Tyr Leu Gly Glu Arg Ala Ala Lys Glu Arg 290 295 300 Thr Phe Ala Ser Phe Ile Leu Ser Gly Phe Gly Gly Ser Ala Asn Val 305 310 315 320 Val Val Tyr Val Cys Arg Pro Ala His Asp Glu Ile Ala Asp Asp Gln 325 330 335 Cys Ile Lys Ala Gly Asp Phe Phe Leu Leu Gly Gly Pro Thr Glu Met 340 345 350 Lys Trp Gly Phe Tyr Arg Ala Tyr His Phe Asp Val Thr Asp Ser Val 355 360 365 Ala Ser Ile Asp Asp Asp Gly His Gly His Tyr Tyr Val Lys Ser Glu 370 375 380 Leu Phe Ser Val Asn Gly Ser Ala Leu Ser Asn Asp Ile Leu Arg Gln 385 390 395 400 Pro Thr Leu Val His Arg Pro Ala Lys Gly His Phe Asp Lys Pro Pro 405 410 415 Val Pro Val Ala Gln Ala Asn Leu Ala Val Arg Lys Asn Ile Asn Asp 420 425 430 Leu Thr Ala Glu Glu Thr Tyr Ser Leu Arg Lys Ala Met Glu Arg Phe 435 440 445 Gln Asn Asp Ser Val Asp Gly Tyr Gln Ala Thr Val Glu Phe His Ala 450 455 460 Leu Pro Ala Arg Cys Pro Arg Pro Asp Ala Lys Asp Arg Phe Ala Cys 465 470 475 480 Cys Val His Gly Met Ala Thr Phe Pro His Trp His Arg Leu Phe Val 485 490 495 Thr Gln Val Glu Asp Ala Leu Leu Arg Arg Gly Ser Thr Ile Gly Leu 500 505 510 Pro Tyr Trp Asp Trp Thr Met Pro Met Asp His Leu Pro Glu Leu Ala 515 520 525 Met Ser Glu Thr Tyr Pro His Pro Val Ser Ser Ala Ile Met Ala Asn 530 535 540 Pro Phe Tyr His Ala Glu Val Ala Phe Glu Asn Asn Val Thr His Arg 545 550 555 560 Asp Thr Asp Asn Arg Leu Phe Glu Lys Pro Ser Phe Gly His His Thr 565 570 575 His Leu Phe Asp Ala Met Val Tyr Ala Phe Glu Gln Glu Asp Phe Cys 580 585 590 Asp Phe Glu Val Gln Phe Glu Leu Val His Asn Asn Ile His Ala Trp 595 600 605 Val Gly Gly Ser Glu Lys Tyr Ser Met Ser Ser Leu His Tyr Thr Ala 610 615 620 Phe Asp Pro Ile Phe Tyr Leu His His Ser Asn Val Asp Arg Leu Trp 625 630 635 640 Ala Ile Trp Gln Ala Leu Gln Ile Arg Arg Asn Lys Pro Tyr Lys Ala 645 650 655 His Cys Ala Trp Ser Gln Glu Arg Glu Pro Leu Lys Pro Phe Ser Phe 660 665 670 Ser Ser Pro Leu Asn Asn Asn Asp Lys Thr Tyr Gln Asn Ser Val Pro 675 680 685 Thr Asn Val Tyr Asp Tyr Glu Gly Val Leu Asp Tyr Arg Tyr Asp Asp 690 695 700 Leu Gln Phe Gly Gly Met Thr Leu Ser Glu Leu Glu Lys Tyr Ile His 705 710 715 720 Lys Gln Lys Gln His Asp Arg Ser Phe Ala Gly Phe Phe Leu Ser His 725 730 735 Ile Gly Thr Ser Ala Asn Val Asp Val Leu Ile Ser Ala Lys Gly Gln 740 745 750 Glu Ala His Thr Ile Gly Ser Phe Ala Val Leu Gly Gly Ser Lys Glu 755 760 765 Met Lys Trp Ala Phe Asp Arg Met Tyr Lys Tyr Glu Ile Thr Asp Ala 770 775 780 Ala Lys Lys Leu Asn Ile Asp Val Asn Asp Ala Thr Ile Ser Val Lys 785 790 795 800 Ile Thr Asp Val Asp Gly Thr Ser Leu Ser Pro Glu Leu Ile Pro Ser 805 810 815 Pro Ala Leu Met Phe Glu Pro Gly His Ile Val Ala Lys Asp Phe Gly 820 825 830 His Ser Lys Lys Ile Arg Lys Asn Val His Ser Leu Thr Ala Glu Glu 835 840 845 Gln Asn Ser Leu Arg Arg Ala Met Ala Asp Leu Gln Asp Asp Lys Thr 850 855 860 Arg Gly Gly Phe Gln Gln Ile Ala Ala Phe His Gly Glu Pro Lys Trp 865 870 875 880 Cys Pro Arg Pro Glu Ala Glu Lys Lys Phe Ala Cys Cys Val His Gly 885 890 895 Met Ala Val Phe Pro His Trp His Arg Leu Leu Thr Val Gln Gly Glu 900 905 910 Asn Ala Leu Arg Lys His Gly Phe Thr Gly Gly Leu Pro Tyr Trp Asp 915 920 925 Trp Thr Arg Pro Met Ser Ala Leu Pro His Phe Val Ala Asp Pro Thr 930 935 940 Tyr Asn Asp Ser Val Ser Ser Leu Glu Glu Asp Asn Pro Trp Tyr His 945 950 955 960 Gly His Ile Asp Ser Val Gly His Asp Thr Thr Arg Ala Val Arg Asp 965 970 975 Asp Leu Tyr Gln Ser Pro Gly Phe Gly His Tyr Thr Asp Ile Ala Lys 980 985 990 Gln Val Leu Leu Ala Phe Glu Gln Asp Asp Phe Cys Asp Phe Glu Val 995 1000 1005 Gln Phe Glu Ile Ala His Asn Phe Ile His Ala Leu Val Gly Gly 1010 1015 1020 Asn Glu Pro Tyr Ser Met Ser Ser Leu Arg Tyr Thr Thr Tyr Asp 1025 1030 1035 Pro Ile Phe Phe Leu His His Ser Asn Thr Asp Arg Leu Trp Ala 1040 1045 1050 Ile Trp Gln Ala Leu Gln Lys Tyr Arg Gly Lys Pro Tyr Asn Thr 1055 1060 1065 Ala Asn Cys Ala Ile Ala Ser Met Arg Lys Pro Leu Gln Pro Phe 1070 1075 1080 Gly Leu Asp Ser Val Ile Asn Pro Asp Asp Glu Thr Arg Glu His 1085 1090 1095 Ser Val Pro Phe Arg Val Phe Asp Tyr Lys Asn Asn Phe Asp Tyr 1100 1105 1110 Glu Tyr Glu Ser Leu Ala Phe Asn Gly Leu Ser Ile Ala Gln Leu 1115 1120 1125 Asp Arg Glu Leu Gln Arg Arg Lys Ser His Asp Arg Val Phe Ala 1130 1135 1140 Gly Phe Leu Leu His Glu Ile Gly Gln Ser Ala Leu Val Lys Phe 1145 1150 1155 Tyr Val Cys Lys His Asn Val Ser Asp Cys Asp His Tyr Ala Gly 1160 1165 1170 Glu Phe Tyr Ile Leu Gly Asp Glu Ala Glu Met Pro Trp Arg Tyr 1175 1180 1185 Asp Arg Val Tyr Lys Tyr Glu Ile Thr Gln Gln Leu His Asp Leu 1190 1195 1200 Asp Leu His Val Gly Asp Asn Phe Phe Leu Lys Tyr Glu Ala Phe 1205 1210 1215 Asp Leu Asn Gly Gly Ser Leu Gly Gly Ser Ile Phe Ser Gln Pro 1220 1225 1230 Ser Val Ile Phe Glu Pro Thr Ala Gly Ser His Gln Ala Asp Glu 1235 1240 1245 Tyr Arg Glu Ala Val Thr Ser Ala Ser His Ile Arg Lys Asn Ile 1250 1255 1260 Arg Asp Leu Ser Glu Gly Glu Ile Glu Ser Ile Arg Ser Ala Phe 1265 1270 1275 Leu Gln Ile Gln Lys Glu Gly Ile Tyr Glu Asn Ile Ala Lys Phe 1280 1285 1290 His Gly Lys Pro Gly Leu Cys Glu Tyr Asp Gly His Pro Val Ala 1295 1300 1305 Cys Cys Val His Gly Met Pro Thr Phe Pro His Trp His Arg Leu 1310 1315 1320 Tyr Val Leu Gln Val Glu Asn Ala Leu Leu Glu Arg Gly Ser Ala 1325 1330 1335 Val Ala Val Pro Tyr Trp Asp Trp Thr Glu Lys Ala Asp Ser Leu 1340 1345 1350 Pro Ser Leu Ile Asn Asp Ala Thr Tyr Phe Asn Ser Arg Ser Gln 1355 1360 1365 Thr Phe Asp Pro Asn Pro Phe Phe Arg Gly His Ile Ala Phe Glu 1370 1375 1380 Asn Ala Val Thr Ser Arg Asp Pro Gln Pro Glu Leu Trp Asp Asn 1385 1390 1395 Lys Asp Phe Tyr Glu Asn Val Met Leu Ala Leu Glu Gln Asp Asn 1400 1405 1410 Phe Cys Asp Phe Glu Ile Gln Leu Glu Leu Ile His Asn Ala Leu 1415 1420 1425 His Ser Arg Leu Gly Gly Arg Ala Lys Tyr Ser Leu Ser Ser Leu 1430 1435 1440 Asp Tyr Thr Ala Phe Asp Pro Val Phe Phe Leu His His Ala Asn 1445 1450 1455 Val Asp Arg Ile Trp Ala Ile Trp Gln Asp Leu Gln Arg Tyr Arg 1460 1465 1470 Lys Lys Pro Tyr Asn Glu Ala Asp Cys Ala Val Asn Glu Met Arg 1475 1480 1485 Lys Pro Leu Gln Pro Phe Asn Asn Pro Glu Leu Asn Ser Asp Ser 1490 1495 1500 Met Thr Leu Lys His Asn Leu Pro Gln Asp Ser Phe Asp Tyr Gln 1505 1510 1515 Asn Arg Phe Arg Tyr Gln Tyr Asp Asn Leu Gln Phe Asn His Phe 1520 1525 1530 Ser Ile Gln Lys Leu Asp Gln Thr Ile Gln Ala Arg Lys Gln His 1535 1540 1545 Asp Arg Val Phe Ala Gly Phe Ile Leu His Asn Ile Gly Thr Ser 1550 1555 1560 Val Val Val Asp Ile Tyr Ile Cys Val Glu Gln Gly Gly Glu Gln 1565 1570 1575 Asn Cys Lys Thr Lys Ala Gly Ser Phe Thr Ile Leu Gly Gly Glu 1580 1585 1590 Thr Glu Met Pro Phe His Phe Asp Arg Leu Tyr Lys Phe Asp Ile 1595 1600 1605 Thr Ser Ala Leu His Lys Leu Gly Val Pro Leu Asp Gly His Gly 1610 1615 1620 Phe Asp Ile Lys Val Asp Val Arg Ala Val Asn Gly Ser His Leu 1625 1630 1635 Asp Pro His Ile Leu Asn Glu Pro Ser Leu Leu Phe Val Pro Gly 1640 1645 1650 Glu His Ser His Gln Thr Ala Asp Gly Leu Ser Gln His Asn Leu 1655 1660 1665 Val Arg Lys Glu Val Ser Ser Leu Thr Thr Leu Glu Lys His Phe 1670 1675 1680 Leu Arg Lys Ala Leu Lys Asn Met Gln Ala Asp Asp Ser Pro Asp 1685 1690 1695 Gly Tyr Gln Ala Ile Ala Ser Phe His Ala Leu Pro Pro Leu Cys 1700 1705 1710 Pro Ser Pro Ser Ala Ala His Arg His Ala Cys Cys Leu His Gly 1715 1720 1725 Met Ala Thr Phe Pro Gln Trp His Arg Leu Tyr Thr Val Gln Phe 1730 1735 1740 Glu Asp Ser Leu Lys Arg His Gly Ser Ile Val Gly Leu Pro Tyr 1745 1750 1755 Trp Asp Trp Leu Lys Pro Gln Ser Ala Leu Pro Asp Leu Val Thr 1760 1765 1770 Gln Glu Thr Tyr Glu His Leu Phe Ser His Lys Thr Phe Pro Asn 1775 1780 1785 Pro Phe Leu Lys Ala Asn Ile Glu Phe Glu Gly Glu Gly Val Thr 1790 1795 1800 Thr Glu Arg Asp Val Asp Ala Glu His Leu Phe Ala Lys Gly Asn 1805 1810 1815 Leu Val Tyr Asn Asn Trp Phe Tyr Asn Gln Ala Leu Tyr Ala Leu 1820 1825 1830 Glu Gln Glu Asn Tyr Cys Asp Phe Glu Ile Gln Phe Glu Ile Leu 1835 1840 1845 His Asn Gly Ile His Ser Trp Val Gly Gly Ser Lys Thr His Ser 1850 1855 1860 Ile Gly His Leu His Tyr Ala Ser Tyr Asp Pro Leu Phe Tyr Ile 1865 1870 1875 His His Ser Gln Thr Asp Arg Ile Trp Ala Ile Trp Gln Ala Phe 1880 1885 1890 Gln Glu His Arg Gly Leu Ser Gly Lys Glu Ala His Cys Ala Leu 1895 1900 1905 Glu Gln Met Lys Asp Pro Leu Lys Pro Phe Ser Phe Gly Ser Pro 1910 1915 1920 Tyr Asn Leu Asn Lys Arg Thr Gln Glu Phe Ser Lys Pro Glu Asp 1925 1930 1935 Thr Phe Asp Tyr His Arg Phe Gly Tyr Glu Tyr Asp Ser Leu Glu 1940 1945 1950 Phe Val Gly Met Ser Val Ser Ser Leu His Asn Tyr Ile Lys Gln 1955 1960 1965 Gln Gln Glu Ala Asp Arg Val Phe Ala Gly Phe Leu Leu Lys Gly 1970 1975 1980 Phe Gly Gln Ser Ala Ser Val Ser Phe Asp Ile Cys Arg Pro Asp 1985 1990 1995 Gln Ser Cys Gln Glu Ala Gly Tyr Phe Ser Val Leu Gly Gly Ser 2000 2005 2010 Ser Glu Met Pro Trp Gln Phe Asp Arg Leu Tyr Lys Tyr Asp Ile 2015 2020 2025 Thr Lys Thr Leu Lys Asp Met Lys Leu Arg Tyr Asp Asp Ala Phe 2030 2035 2040 Thr Ile Lys Val His Ile Lys Asp Ile Ala Gly Ala Glu Leu Asp 2045 2050 2055 Ser Asp Leu Ile Pro Thr Pro Ser Val Leu Phe Glu Glu Gly Lys 2060 2065 2070 His Gly Ile Asn Val Arg His Val Gly Arg Asn Arg Ile Arg Met 2075 2080 2085 Glu Leu Ser Glu Leu Thr Glu Arg Asp Leu Ala Ser Leu Lys Ser 2090 2095 2100 Ala Met Arg Ser Leu Gln Ala Asp Asp Gly Val Asn Gly Tyr Gln 2105 2110 2115 Ala Ile Ala Ser Phe His Gly Leu Pro Ala Ser Cys His Asp Asp 2120 2125 2130 Glu Gly His Glu Ile Ala Cys Cys Ile His Gly Met Pro Val Phe 2135 2140 2145 Pro His Trp His Arg Leu Tyr Thr Leu Gln Met Asp Met Ala Leu 2150 2155 2160 Leu Ser His Gly Ser Ala Val Ala Ile Pro Tyr Trp Asp Trp Thr 2165 2170 2175 Lys Pro Ile Ser Lys Leu Pro Asp Leu Phe Thr Ser Pro Glu Tyr 2180 2185 2190 Tyr Asp Pro Trp Arg Asp Ala Val Val Asn Asn Pro Phe Ala Lys 2195 2200 2205 Gly Tyr Ile Lys Ser Glu Asp Ala Tyr Thr Val Arg Asp Pro Gln 2210 2215 2220 Asp Ile Leu Tyr His Leu Gln Asp Glu Thr Gly Thr Ser Val Leu 2225 2230 2235 Leu Asp Gln Thr Leu Leu Ala Leu Glu Gln Thr Asp Phe Cys Asp 2240 2245 2250 Phe Glu Val Gln Phe Glu Val Val His Asn Ala Ile His Tyr Leu 2255 2260 2265 Val Gly Gly Arg Gln Val Tyr Ala Leu Ser Ser Leu His Tyr Ala 2270 2275 2280 Ser Tyr Asp Pro Ala Phe Phe Ile His His Ser Phe Val Asp Lys 2285 2290 2295 Ile Trp Ala Val Trp Gln Ala Leu Gln Lys Lys Arg Lys Arg Pro 2300 2305 2310 Tyr His Lys Ala Asp Cys Ala Leu Asn Met Met Thr Lys Pro Met 2315 2320 2325 Arg Pro Phe Ala His Asp Phe Asn His Asn Gly Phe Thr Lys Met 2330 2335 2340 His Ala Val Pro Asn Thr Leu Phe Asp Phe Gln Asp Leu Phe Tyr 2345 2350 2355 Thr Tyr Asp Asn Leu Glu Ile Ala Gly Met Asn Val Asn Gln Leu 2360 2365 2370 Glu Ala Glu Ile Asn Arg Arg Lys Ser Gln Thr Arg Val Phe Ala 2375 2380 2385 Gly Phe Leu Leu His Gly Ile Gly Arg Ser Ala Asp Val Arg Phe 2390 2395 2400 Trp Ile Cys Lys Thr Ala Asp Asp Cys His Ala Ser Gly Met Ile 2405 2410 2415 Phe Ile Leu Gly Gly Ser Lys Glu Met His Trp Ala Tyr Asp Arg 2420 2425 2430 Asn Phe Lys Tyr Asp Ile Thr Gln Ala Leu Lys Ala Gln Ser Ile 2435 2440 2445 His Pro Glu Asp Val Phe Asp Thr Asp Ala Pro Phe Phe Ile Lys 2450 2455 2460 Val Glu Ile His Gly Val Asn Lys Thr Ala Leu Pro Ser Ser Ala 2465 2470 2475 Ile Pro Ala Pro Thr Ile Ile Tyr Ser Ala Gly Glu Gly His Thr 2480 2485 2490 Asp Asp His Gly Ser Asp His Ile Ala Gly Ser Gly Val Arg Lys 2495 2500 2505 Asp Val Thr Ser Leu Thr Ala Ser Glu Ile Glu Asn Leu Arg His 2510 2515 2520 Ala Leu Gln Ser Val Met Asp Asp Asp Gly Pro Asn Gly Phe Gln 2525 2530 2535 Ala Ile Ala Ala Tyr His Gly Ser Pro Pro Met Cys His Met Pro 2540 2545 2550 Asp Gly Arg Asp Val Ala Cys Cys Thr His Gly Met Ala Ser Phe 2555 2560 2565 Pro His Trp His Arg Leu Phe Val Lys Gln Met Glu Asp Ala Leu 2570 2575 2580 Ala Ala His Gly Ala His Ile Gly Ile Pro Tyr Trp Asp Trp Thr 2585 2590 2595 Ser Ala Phe Ser His Leu Pro Ala Leu Val Thr Asp His Glu His 2600 2605 2610 Asn Pro Phe His His Gly His Ile Ala His Arg Asn Val Asp Thr 2615 2620 2625 Ser Arg Ser Pro Arg Asp Met Leu Phe Asn Asp Pro Glu His Gly 2630 2635 2640 Ser Glu Ser Phe Phe Tyr Arg Gln Val Leu Leu Ala Leu Glu Gln 2645 2650 2655 Thr Asp Phe Cys Gln Phe Glu Val Gln Phe Glu Ile Thr His Asn 2660 2665 2670 Ala Ile His Ser Trp Thr Gly Gly His Thr Pro Tyr Gly Met Ser 2675 2680 2685 Ser Leu Glu Tyr Thr Ala Tyr Asp Pro Leu Phe Tyr Leu His His 2690 2695 2700 Ser Asn Thr Asp Arg Ile Trp Ala Ile Trp Gln Ala Leu Gln Lys 2705 2710 2715 Tyr Arg Gly Phe Gln Tyr Asn Ala Ala His Cys Asp Ile Gln Val 2720 2725 2730 Leu Lys Gln Pro Leu Lys Pro Phe Ser Glu Ser Arg Asn Pro Asn 2735 2740 2745 Pro Val Thr Arg Ala Asn Ser Arg Ala Val Asp Ser Phe Asp Tyr 2750 2755 2760 Glu Arg Leu Asn Tyr Gln Tyr Asp Thr Leu Thr Phe His Gly His 2765 2770 2775 Ser Ile Ser Glu Leu Asp Ala Met Leu Gln Glu Arg Lys Lys Glu 2780 2785 2790 Glu Arg Thr Phe Ala Ala Phe Leu Leu His Gly Phe Gly Ala Ser 2795 2800 2805 Ala Asp Val Ser Phe Asp Val Cys Thr Pro Asp Gly His Cys Ala 2810 2815 2820 Phe Ala Gly Thr Phe Ala Val Leu Gly Gly Glu Leu Glu Met Pro 2825 2830 2835 Trp Ser Phe Glu Arg Leu Phe Arg Tyr Asp Ile Thr Lys Val Leu 2840 2845 2850 Lys Gln Met Asn Leu Tyr Tyr Asp Ser Glu Phe His Phe Ala Leu 2855 2860 2865 Lys Val Val Gly Thr Asp Gly Thr Glu Leu Pro Ser Asp Ser Ile 2870 2875 2880 Lys Ser Pro Thr Ile Glu His His Gln Ala Asp His Gln Gly His 2885 2890 2895 Gly Gly Glu Gly Gly Gln His Pro Glu His Thr Gly Phe Val Arg 2900 2905 2910 Lys Asp Ile Asp Ser Leu Ser Leu Asp Glu Ala Asn Asp Leu Lys 2915 2920 2925 Asn Ala Leu Tyr Lys Leu Gln Asn Asp Gln Ser Leu Ser Gly Tyr 2930 2935 2940 Glu Ser Ile Ala Gly Phe His Gly Tyr Pro Tyr Leu Cys Pro Asp 2945 2950 2955 Asn Gly Asp Asp Lys Tyr Ala Cys Cys Val His Gly Met Pro Val 2960 2965 2970 Phe Pro His Trp His Arg Leu Leu Thr Ile Gln Phe Glu Arg Ala 2975 2980 2985 Leu Lys Glu His Gly Ser His Leu Gly Val Pro Tyr Trp Asp Trp 2990 2995 3000 Thr Lys Ser Met Thr Ala Leu Pro Ala Phe Phe Ala Asp Ser Ser 3005 3010 3015 Asn Asn Asn Pro Phe Tyr Lys Tyr His Ile Met Lys Ala Gly His 3020 3025 3030 Asp Thr Thr Arg Gln Pro Asn Asp Leu Leu Phe Asn Gln Pro Gln 3035 3040 3045 Val His Gly Tyr Asp Tyr Leu Tyr Tyr Leu Ala Leu Ser Thr Leu 3050 3055 3060 Glu Glu Asp Asn Tyr Cys Asp Phe Glu Val Gln Tyr Glu Asn Phe 3065 3070 3075 His Asn Ala Val His Ile Trp Leu Gly Gly Thr Glu Thr Tyr Ser 3080 3085 3090 Met Ala Thr Leu Glu Tyr Ser Ala Phe Asp Pro Val Phe Met Ile 3095 3100 3105 Leu His Ser Gly Leu Asp Arg Leu Trp Ile Ile Trp Gln Lys Leu 3110 3115 3120 Gln Lys Leu Arg Lys Lys Pro Tyr Asn Ala Ala Lys Cys Ala Gly 3125 3130 3135 His Met Ile Asp Glu Pro Leu His Pro Phe Asp Tyr Glu Ser Val 3140 3145 3150 Asn Gln Asp Gly Phe Thr Arg Ala Asn Ala Lys Pro Thr Thr Val 3155 3160 3165 Phe Asp Ser His Arg Phe Asp Tyr His Tyr Asp Asn Leu Asp Val 3170 3175 3180 Arg Gly Lys Ser Ile Gln Glu Ile Asn Ala Ile Ile His Asp Leu 3185 3190 3195 Arg Thr Gln Pro Arg Ile Phe Ala Gly Phe Val Leu Ser Gly Ile 3200 3205 3210 Tyr Thr Ser Ala Asn Val Lys Ile Tyr Leu Ile Arg Glu Gly His 3215 3220 3225 Asp Asp Glu Tyr Val Gly Thr Phe Ala Val Leu Gly Gly Ala Lys 3230 3235 3240 Glu Met Pro Trp Ala Tyr Glu Arg Ile Phe Lys Tyr Asp Ile Thr 3245 3250 3255 Asp Val Ala His Lys Leu Lys Met His His Asp Asp Thr Phe Asn 3260 3265 3270 Phe Arg Leu Glu Ile Gln Ser Tyr Ser Gly Glu Thr Leu Thr Ser 3275 3280 3285 His Leu Pro Glu Pro Leu Ile Ile His Arg Pro Ala Asn Lys Asp 3290 3295 3300 Tyr Asp Val Leu Ile Ile Pro Leu Gly Ser Gly Arg Lys Leu Pro 3305 3310 3315 Pro Lys Val Ile Val Lys Arg Gly Thr Glu Ile Glu Phe His Pro 3320 3325 3330 Val Asp Asp Thr Ile Asn Arg Pro Val Val Asp Leu Gly Ser Tyr 3335 3340 3345 Thr Ala Leu Tyr Asn Cys Val Val Pro Pro Phe Thr Tyr His Gly 3350 3355 3360 Tyr Glu Leu Asp His Ala Tyr Ser Leu Arg Asp Gly His Tyr Tyr 3365 3370 3375 Ile Ala Gly Pro Thr Lys Asp Leu Cys Ala Ser Gly Asn Val Arg 3380 3385 3390 Ile His Ile His Ile Glu Asp Glu 3395 3400 4 57 DNA Megathura crenulata 4 atgtggacca tcttggctct cctcacggcg accctcttgt ttgagggagc cttctcg 57 5 57 DNA Megathura crenulata SEQ ID NO5 is the complementary sequence of the coding sequence of SEQ ID NO4 5 cgagaaggct ccctcaaaca agagggtcgc cgtgaggaga gccaagatgg tccacat 57 6 19 PRT Megathura crenulata SEQ ID NO6 is the deduced amino acid sequence of SEQ ID NO4 6 Met Trp Thr Ile Leu Ala Leu Leu Thr Ala Thr Leu Leu Phe Glu Gly 1 5 10 15 Ala Phe Ser 7 10263 DNA Megathura crenulata 7 atgtggacca tcttggctct cctcacggcg accctcttgt ttgagggagc cttctcggta 60 gacacagtgg tgagaaagaa tgtggacagt ttgagcagtg acgaggttct agccctcgag 120 aaggctctcg atgatctcca gcaagacgac agcaaccagg gttatcaggc catagcaggc 180 taccatggtg ttccaacaat gtgtgtggac aaacatgaaa aaaatgtggc ctgttgtctt 240 cacggtatgc catcctttcc cctttggcat cgtctctacg ttgtccagtt ggaaagagcg 300 ctgataagga agaaggcaac catcagcatc ccctactggg attggactag tgagctcacc 360 catcttccag aacttgtgtc tcatccactg tttgtcgaca ctgagggggg caaggctcat 420 gacaattctt ggtaccgagc tgacatcact ttcctcaaca agaagacaag tcgtgctgta 480 gatgaccgtc tttttgaaaa agtacagcca ggacaccaca ctcgcctcat ggagggaatc 540 cttgacgctc tggaacagga cgagttttgc aaatttgaga tccagtttga gctggctcac 600 aacgccatac attatctagt tgggggcaga cacacatatt ccatgtctca cctcgagtac 660 acgtcatacg atcctttgtt ctttttacat cacttcaaca ctgaccgtat ctttgctata 720 tggcaacgac ttcaacagct cagaggcaag gatcccaaca gtgctgactg tgcccataac 780 cttattcaca ctccaatgga accgttcgac agagacacta atccactgga tctgacaaga 840 gagcacgcca aacctgctga tagctttgac tacgggcgcc tcggctacca gtatgacgac 900 ctctccttga atggcatgag tccagaggaa ctcaacgtat atctaggaga acgagcagct 960 aaggagcgta catttgccag cttcatatta tctggttttg gtggctctgc caatgttgta 1020 gtctatgtct gccgacctgc tcatgatgaa atcgctgatg atcaatgtat caaagcagga 1080 gatttcttcc ttcttggcgg accaactgaa atgaagtggg gattctacag ggcataccac 1140 tttgatgtta ctgacagcgt tgcttctata gatgatgatg gtcatgggca ttattacgtg 1200 aagtccgagt tattcagcgt caacggaagc gctttgtcaa atgacattct tcgacaaccc 1260 accttagtgc accgaccagc taaaggtcat tttgataagc ctccagttcc agttgctcaa 1320 gctaacctgg ctgtcagaaa gaatatcaac gatctaactg cagaggaaac atacagcttg 1380 aggaaagcta tggaaagatt ccagaatgac tcagttgacg ggtaccaagc tacagtagaa 1440 ttccatgccc ttccagcaag atgtccacgc ccagatgcta aagatagatt tgcatgctgc 1500 gttcatggca tggcaacctt cccccactgg catagactct ttgtcactca agttgaagat 1560 gctctacttc gtcgcgggtc tactatcggt ctaccttact gggactggac catgccaatg 1620 gatcatctgc cagaactggc tatgagtgag acctatcccc atccagtttc tagcgcaatc 1680 atggcaaatc cattttacca tgccgaagtg gcttttgaaa ataacgtcac acacagagat 1740 acagacaata gactctttga gaaaccgtct tttggacacc acacgcatct gtttgacgct 1800 atggtctatg cttttgaaca agaagacttt tgcgactttg aagttcagtt tgaactcgtt 1860 cacaataaca tacatgcgtg ggttggagga agtgaaaagt actccatgtc ttctcttcac 1920 tacactgctt ttgatcctat tttctacctc catcactcaa atgttgatcg tctctgggcc 1980 atttggcaag ctcttcaaat caggaggaac aaaccttaca aggcccattg tgcctggtca 2040 caggaaaggg aaccattaaa gcctttttca ttcagttcac ctctcaacaa caacgataaa 2100 acctaccaaa actctgttcc aactaatgtt tacgactatg agggagtttt ggactatcga 2160 tatgatgacc tccagtttgg tggcatgacc ctgtcagaac ttgaaaaata tatacacaag 2220 cagaaacagc atgacagaag ttttgcagga ttcttccttt ctcatattgg aacatcagca 2280 aacgtagacg tccttatcag tgcaaagggc caagaggccc acacgattgg aagctttgca 2340 gtgcttggtg gatctaagga aatgaaatgg gctttcgaca gaatgtacaa atatgaaatt 2400 actgatgctg ccaaaaagct gaacattgat gtgaatgatg ccactatttc tgttaagatc 2460 actgatgttg atggaacttc cttgtcacca gagctaattc cgtctcctgc tctcatgttt 2520 gaacctgggc atattgttgc caaagacttt ggccacagca aaaaaatcag gaagaacgtt 2580 cattccctga cagctgaaga acaaaattcg ttgaggcgag ctatggcaga tctccaagat 2640 gacaaaacac gaggtggatt ccaacagatt gcagcatttc acggagaacc aaaatggtgc 2700 ccacgccctg aagcggagaa gaaatttgca tgctgcgttc acggaatggc tgttttccct 2760 cactggcaca gattgctgac agttcaagga gaaaatgctc tgaggaaaca tggatttact 2820 ggtggattgc cctattggga ctggactcga ccaatgagcg cccttccaca ttttgttgct 2880 gatcctactt acaatgattc tgtttccagc ctcgaagaag ataacccatg gtatcatggt 2940 cacatagatt ctgttgggca tgatactaca agagctgtgc gtgatgatct ttatcaatct 3000 cctggtttcg gtcactacac agatattgca aaacaagtcc ttctggcctt tgagcaggac 3060 gatttctgtg attttgaggt acaatttgaa attgcccata atttcataca tgctctagtt 3120 ggtggtaacg aaccatacag tatgtcatct ttgaggtata ctacatacga tccaatcttc 3180 ttcttgcacc actccaatac agaccgactt tgggccattt ggcaagcttt gcaaaaatac 3240 cgggggaaac catacaacac tgcaaactgt gccattgcat ccatgagaaa accacttcag 3300 ccatttggtc ttgatagtgt cataaatcca gatgacgaaa ctcgtgaaca ttcggttcct 3360 ttccgagtct tcgactacaa gaacaacttc gactatgagt atgagagcct ggcatttaat 3420 ggtctgtcta ttgcccaact ggaccgagag ttgcagagaa gaaagtcaca tgacagagtc 3480 tttgcaggat tccttcttca tgaaattgga cagtctgcac tcgtgaaatt ctacgtttgc 3540 aaacacaatg tatctgactg tgaccattat gctggagaat tctacatttt gggagatgaa 3600 gctgagatgc cttggaggta tgaccgtgtg tacaagtacg agataacaca gcagctgcac 3660 gatttagatc tacatgttgg agataatttc tttcttaaat atgaagcctt tgatctgaat 3720 ggcggaagtc ttggtggaag tatcttttct cagccttcgg tgattttcga gccaactgca 3780 ggttcacacc aggctgatga atatcgtgag gcagtaacaa gcgctagcca cataagaaaa 3840 aatatccggg acctctcaga gggagaaatt gagagcatca gatctgcttt cctccaaatt 3900 caaaaagagg gtatatatga aaacattgca aagttccatg gaaaaccagg actttgtgaa 3960 tatgatggac atcctgttgc ttgttgtgtc catggcatgc ccacctttcc ccactggcac 4020 agactgtacg ttcttcaggt ggagaatgcg ctcttagaac gagggtctgc agttgctgtt 4080 ccttactggg actggaccga gaaagctgac tctctgccat cattaatcaa tgatgcaact 4140 tatttcaatt cacgatccca gacctttgat cctaatcctt tcttcagggg acatatcgcc 4200 ttcgagaatg ctgtgacgtc cagagatcct cagccagaac tatgggacaa taaggacttc 4260 tacgagaatg tcatgctggc tcttgagcaa gacaacttct gtgactttga gattcagctt 4320 gagctgatac acaacgccct tcattctaga cttggaggaa gggctaaata ctccctttcg 4380 tctcttgatt ataccgcatt tgatcctgta tttttccttc accatgcaaa cgttgacaga 4440 atctgggcca tctggcagga cttgcagaga tatagaaaga aaccatacaa tgaggctgac 4500 tgcgcagtca acgagatgcg taaacctctt caaccattta ataacccaga acttaacagt 4560 gattccatga cgcttaaaca caacctccca caagacagtt ttgattatca aaaccgcttc 4620 aggtaccaat atgataacct tcaatttaac cacttcagca tacaaaagct agaccaaact 4680 attcaggcta gaaaacaaca cgacagagtt tttgctggct ttattcttca caacattggg 4740 acatctgttg ttgtagatat ttatatttgc gttgaacaag gaggagaaca aaactgcaag 4800 acaaaggcgg gttccttcac gattctgggg ggagaaacag aaatgccatt ccactttgac 4860 cgcttgtaca aatttgacat aacgtctgct ctgcataaac ttggtgttcc cttggacgga 4920 catggattcg acatcaaagt tgacgtcaga gctgtcaatg gatcgcatct tgatccacac 4980 atcctcaacg aaccgagtct gctttttgtt cctggtgaac attcacatca aactgcagat 5040 gggctttcac aacataatct tgtgcgaaaa gaagtaagct ctcttacaac actggagaaa 5100 cattttttga ggaaagctct caagaacatg caagcagatg attctccaga cggatatcaa 5160 gctattgctt ctttccacgc tttgcctcct ctttgtccaa gtccatctgc tgcacataga 5220 cacgcttgtt gcctccatgg tatggctacc ttccctcagt ggcacagact ctacacagtt 5280 cagttcgaag attctttgaa acgacatggt tctattgtcg gacttccata ttgggattgg 5340 ctgaaaccgc agtctgcact ccctgatttg gtgacacagg agacatacga gcacctgttt 5400 tcacacaaaa ccttcccaaa tccgttcctc aaggcaaata tagaatttga gggagaggga 5460 gtaacaacag agagggatgt tgatgctgaa cacctctttg caaaaggaaa tctggtttac 5520 aacaactggt tttacaatca ggcactatat gcactagaac aagaaaatta ctgtgacttt 5580 gaaatacagt tcgaaatttt gcataatgga attcattcat gggttggagg atcaaagacc 5640 cattcaatag gtcatcttca ttacgcatca tacgatccac tgttctatat ccaccattcg 5700 cagacagatc gcatttgggc tatctggcaa gctttccagg agcacagagg tctttcaggg 5760 aaggaagcac actgcgccct ggagcaaatg aaagaccctc tcaaaccttt cagctttgga 5820 agtccctata atttgaacaa acgcactcaa gagttctcca agcctgaaga cacatttgat 5880 tatcaccgat tcgggtatga gtatgattcc ctcgaatttg ttggcatgtc tgtttcaagt 5940 ttacataact atataaaaca acaacaggaa gctgatagag tcttcgcagg attccttctt 6000 aaaggatttg gacaatcagc atccgtatcg tttgatatct gcagaccaga ccagagttgc 6060 caagaagctg gatacttctc agttctcggt ggaagttcag aaatgccgtg gcagtttgac 6120 aggctttaca agtacgacat tacaaaaacg ttgaaagaca tgaaactgcg atacgatgac 6180 gcatttacca tcaaggttca cataaaggat atagctggag ctgagttgga cagcgatctg 6240 attccaactc cttctgttct ctttgaagaa ggaaagcatg ggatcaatgt acgtcacgtt 6300 ggtcgtaatc ggattcgtat ggaactatct gaactcaccg agagagatct cgccagcctg 6360 aaatctgcaa tgaggtctct acaagctgac gatggggtga acggttatca agccattgca 6420 tcattccacg gtctcccggc ttcttgtcat gatgatgagg gacatgagat tgcctgttgt 6480 atccacggaa tgccagtatt cccacactgg cacaggcttt acaccctgca aatggacatg 6540 gctctgttat ctcacggatc tgctgttgct attccatact gggactggac caaacctatc 6600 agcaaactgc ctgatctctt caccagccct gaatattacg atccttggag ggatgcagtt 6660 gtcaataatc catttgctaa aggctacatt aaatccgagg acgcttacac ggttagggat 6720 cctcaggaca ttttgtacca cttgcaggac gaaacgggaa catctgtttt gttagatcaa 6780 actcttttag ccttagaaca gacagatttc tgtgattttg aggttcaatt tgaggtcgtc 6840 cataatgcta ttcactactt ggtgggtggt cgacaagttt atgctctttc ttctctacac 6900 tatgcttcat atgacccagc cttctttatt catcactcct ttgttgacaa aatatgggca 6960 gtctggcaag ctctgcaaaa gaagagaaag cgtccctatc ataaagcgga ttgtgctctt 7020 aacatgatga ccaaaccaat gcgaccattt gcacacgatt tcaatcacaa tggattcaca 7080 aaaatgcacg cagtccccaa cactctattt gactttcagg accttttcta cacgtatgac 7140 aacttagaaa ttgctggcat gaatgttaat cagttggaag cggaaatcaa ccggcgaaaa 7200 agccaaacaa gagtctttgc cgggttcctt ctacatggca ttggaagatc agctgatgta 7260 cgattttgga tttgcaagac agctgacgac tgccacgcat ctggcatgat ctttatctta 7320 ggaggttcta aagagatgca ctgggcctat gacaggaact ttaaatacga catcacccaa 7380 gctttgaagg ctcagtccat acaccctgaa gatgtgtttg acactgatgc tcctttcttc 7440 attaaagtgg agatccatgg tgtaaacaag actgctctcc catcttcagc tatcccagca 7500 cctactataa tctactcagc tggtgaaggt cataccgatg atcacggctc agatcatatt 7560 gctggcagtg gagtcaggaa agacgtgacg tctcttaccg catctgagat agagaacctg 7620 aggcatgctc tgcaaagcgt gatggatgat gatggaccca atggattcca ggcaattgct 7680 gcttatcacg gaagtcctcc catgtgtcac atgcctgatg gtagagacgt tgcatgttgt 7740 actcatggaa tggcatcttt ccctcactgg cacagactgt ttgtgaaaca gatggaggat 7800 gcactggctg cgcatggagc tcacattggc ataccatact gggattggac aagtgcgttt 7860 agtcatctgc ctgccctagt gactgaccac gagcacaatc ccttccacca cggacatatt 7920 gctcatcgga atgtggatac atctcgatct ccgagagaca tgctgttcaa tgaccctgaa 7980 cacgggtcag aatcattctt ctatagacag gttctcttgg ctctagaaca gacagacttc 8040 tgccaatttg aagttcagtt tgaaataaca cacaatgcaa tccactcttg gactggagga 8100 catactccat atggaatgtc atcactggaa tatacagcat atgatccact cttttatctc 8160 caccattcca acactgatcg tatctgggcc atctggcagg cactccagaa atacagaggt 8220 tttcaataca acgcagctca ttgcgatatc caggttctga aacaacctct taaaccattc 8280 agcgagtcca ggaatccaaa cccagtcacc agagccaatt ctagggcagt cgattcattt 8340 gattatgaga gactcaatta tcaatatgac acacttacct tccacggaca ttctatctca 8400 gaacttgatg ccatgcttca agagagaaag aaggaagaga gaacatttgc agccttcctg 8460 ttgcacggat ttggcgccag tgctgatgtt tcgtttgatg tctgcacacc tgatggtcat 8520 tgtgcctttg ctggaacctt cgcggtactt ggtggggagc tggagatgcc ctggtccttt 8580 gaaagattgt tccgttatga tatcacaaag gttctcaagc agatgaatct ttactatgat 8640 tctgagtttc atttcgcgtt gaaggttgtt ggcacagatg gaacagaact gccatcggat 8700 agtatcaaga gcccaacaat tgagcatcat caagcagatc accagggtca tggtggggag 8760 ggtggccaac atcctgaaca tactggattt gtaaggaagg acatagattc tctttccttg 8820 gatgaagcca atgatctgaa aaatgcactg tacaagctgc agaacgacca aagcctcagt 8880 ggatatgaat caatagctgg tttccatggc tatccctacc tctgtcctga caacggcgat 8940 gacaagtatg cctgctgtgt ccatggaatg cctgtatttc cacactggca cagacttctt 9000 actatccagt ttgagagagc actcaaagaa catggttcac atttgggtgt tccctattgg 9060 gattggacca agtcaatgac tgctcttcca gctttctttg ccgattcttc caataacaac 9120 cctttctaca aatatcacat tatgaaagct ggacatgata caacgagaca gccgaatgat 9180 cttctcttca atcaaccaca ggttcatgga tatgactatc tatactacct tgccctttca 9240 acactagaag aggataacta ctgtgacttt gaggttcagt atgaaaattt tcacaatgca 9300 gtacatatct ggctcggggg aactgaaacc tactccatgg ctactcttga atattcagcg 9360 tttgatccag tctttatgat cttgcattct ggtctcgaca gactctggat catttggcaa 9420 aagctgcaaa agctcagaaa gaaaccatat aatgcagcca aatgtgctgg ccacatgatt 9480 gatgaacctt tgcatccttt tgactacgag tctgttaacc aagatggctt cactcgagca 9540 aacgctaaac ctacaacagt gtttgacagt cacaggtttg attatcacta tgacaacctg 9600 gatgtacgag gaaagagtat tcaggaaata aacgccatca tccacgattt gaggacccaa 9660 ccaagaatct ttgctggttt tgttctttct ggaatataca cttcagctaa tgttaagata 9720 tacttaatca gagaaggcca tgacgatgaa tatgtaggaa catttgcagt tttaggagga 9780 gccaaggaga tgccatgggc ctatgagagg attttcaagt atgacattac agatgtcgcc 9840 cacaaactga aaatgcacca tgatgataca ttcaacttca gacttgaaat tcaatcttac 9900 agtggcgaaa cactaacatc acatcttcct gaacctctca taatccacag accagccaac 9960 aaggactatg atgtcttgat tattccatta ggaagcggac gcaagcttcc tcctaaagtc 10020 attgtcaaga gaggaacaga aattgagttt catccagtcg atgacacgat caacagacct 10080 gttgtagatc ttggcagcta cactgctctc tacaactgtg tggtgccacc cttcacatac 10140 cacggatatg agctcgatca tgcctactcc cttagggatg ggcactacta tattgcaggc 10200 ccaaccaaag acctctgcgc cagcggaaat gttagaatac atatccacat tgaagacgag 10260 taa 10263 8 10263 DNA Megathura crenulata SEQ ID NO8 is the complementary sequence of the coding sequence of SEQ ID NO7 8 ttactcgtct tcaatgtgga tatgtattct aacatttccg ctggcgcaga ggtctttggt 60 tgggcctgca atatagtagt gcccatccct aagggagtag gcatgatcga gctcatatcc 120 gtggtatgtg aagggtggca ccacacagtt gtagagagca gtgtagctgc caagatctac 180 aacaggtctg ttgatcgtgt catcgactgg atgaaactca atttctgttc ctctcttgac 240 aatgacttta ggaggaagct tgcgtccgct tcctaatgga ataatcaaga catcatagtc 300 cttgttggct ggtctgtgga ttatgagagg ttcaggaaga tgtgatgtta gtgtttcgcc 360 actgtaagat tgaatttcaa gtctgaagtt gaatgtatca tcatggtgca ttttcagttt 420 gtgggcgaca tctgtaatgt catacttgaa aatcctctca taggcccatg gcatctcctt 480 ggctcctcct aaaactgcaa atgttcctac atattcatcg tcatggcctt ctctgattaa 540 gtatatctta acattagctg aagtgtatat tccagaaaga acaaaaccag caaagattct 600 tggttgggtc ctcaaatcgt ggatgatggc gtttatttcc tgaatactct ttcctcgtac 660 atccaggttg tcatagtgat aatcaaacct gtgactgtca aacactgttg taggtttagc 720 gtttgctcga gtgaagccat cttggttaac agactcgtag tcaaaaggat gcaaaggttc 780 atcaatcatg tggccagcac atttggctgc attatatggt ttctttctga gcttttgcag 840 cttttgccaa atgatccaga gtctgtcgag accagaatgc aagatcataa agactggatc 900 aaacgctgaa tattcaagag tagccatgga gtaggtttca gttcccccga gccagatatg 960 tactgcattg tgaaaatttt catactgaac ctcaaagtca cagtagttat cctcttctag 1020 tgttgaaagg gcaaggtagt atagatagtc atatccatga acctgtggtt gattgaagag 1080 aagatcattc ggctgtctcg ttgtatcatg tccagctttc ataatgtgat atttgtagaa 1140 agggttgtta ttggaagaat cggcaaagaa agctggaaga gcagtcattg acttggtcca 1200 atcccaatag ggaacaccca aatgtgaacc atgttctttg agtgctctct caaactggat 1260 agtaagaagt ctgtgccagt gtggaaatac aggcattcca tggacacagc aggcatactt 1320 gtcatcgccg ttgtcaggac agaggtaggg atagccatgg aaaccagcta ttgattcata 1380 tccactgagg ctttggtcgt tctgcagctt gtacagtgca tttttcagat cattggcttc 1440 atccaaggaa agagaatcta tgtccttcct tacaaatcca gtatgttcag gatgttggcc 1500 accctcccca ccatgaccct ggtgatctgc ttgatgatgc tcaattgttg ggctcttgat 1560 actatccgat ggcagttctg ttccatctgt gccaacaacc ttcaacgcga aatgaaactc 1620 agaatcatag taaagattca tctgcttgag aacctttgtg atatcataac ggaacaatct 1680 ttcaaaggac cagggcatct ccagctcccc accaagtacc gcgaaggttc cagcaaaggc 1740 acaatgacca tcaggtgtgc agacatcaaa cgaaacatca gcactggcgc caaatccgtg 1800 caacaggaag gctgcaaatg ttctctcttc cttctttctc tcttgaagca tggcatcaag 1860 ttctgagata gaatgtccgt ggaaggtaag tgtgtcatat tgataattga gtctctcata 1920 atcaaatgaa tcgactgccc tagaattggc tctggtgact gggtttggat tcctggactc 1980 gctgaatggt ttaagaggtt gtttcagaac ctggatatcg caatgagctg cgttgtattg 2040 aaaacctctg tatttctgga gtgcctgcca gatggcccag atacgatcag tgttggaatg 2100 gtggagataa aagagtggat catatgctgt atattccagt gatgacattc catatggagt 2160 atgtcctcca gtccaagagt ggattgcatt gtgtgttatt tcaaactgaa cttcaaattg 2220 gcagaagtct gtctgttcta gagccaagag aacctgtcta tagaagaatg attctgaccc 2280 gtgttcaggg tcattgaaca gcatgtctct cggagatcga gatgtatcca cattccgatg 2340 agcaatatgt ccgtggtgga agggattgtg ctcgtggtca gtcactaggg caggcagatg 2400 actaaacgca cttgtccaat cccagtatgg tatgccaatg tgagctccat gcgcagccag 2460 tgcatcctcc atctgtttca caaacagtct gtgccagtga gggaaagatg ccattccatg 2520 agtacaacat gcaacgtctc taccatcagg catgtgacac atgggaggac ttccgtgata 2580 agcagcaatt gcctggaatc cattgggtcc atcatcatcc atcacgcttt gcagagcatg 2640 cctcaggttc tctatctcag atgcggtaag agacgtcacg tctttcctga ctccactgcc 2700 agcaatatga tctgagccgt gatcatcggt atgaccttca ccagctgagt agattatagt 2760 aggtgctggg atagctgaag atgggagagc agtcttgttt acaccatgga tctccacttt 2820 aatgaagaaa ggagcatcag tgtcaaacac atcttcaggg tgtatggact gagccttcaa 2880 agcttgggtg atgtcgtatt taaagttcct gtcataggcc cagtgcatct ctttagaacc 2940 tcctaagata aagatcatgc cagatgcgtg gcagtcgtca gctgtcttgc aaatccaaaa 3000 tcgtacatca gctgatcttc caatgccatg tagaaggaac ccggcaaaga ctcttgtttg 3060 gctttttcgc cggttgattt ccgcttccaa ctgattaaca ttcatgccag caatttctaa 3120 gttgtcatac gtgtagaaaa ggtcctgaaa gtcaaataga gtgttgggga ctgcgtgcat 3180 ttttgtgaat ccattgtgat tgaaatcgtg tgcaaatggt cgcattggtt tggtcatcat 3240 gttaagagca caatccgctt tatgataggg acgctttctc ttcttttgca gagcttgcca 3300 gactgcccat attttgtcaa caaaggagtg atgaataaag aaggctgggt catatgaagc 3360 atagtgtaga gaagaaagag cataaacttg tcgaccaccc accaagtagt gaatagcatt 3420 atggacgacc tcaaattgaa cctcaaaatc acagaaatct gtctgttcta aggctaaaag 3480 agtttgatct aacaaaacag atgttcccgt ttcgtcctgc aagtggtaca aaatgtcctg 3540 aggatcccta accgtgtaag cgtcctcgga tttaatgtag cctttagcaa atggattatt 3600 gacaactgca tccctccaag gatcgtaata ttcagggctg gtgaagagat caggcagttt 3660 gctgataggt ttggtccagt cccagtatgg aatagcaaca gcagatccgt gagataacag 3720 agccatgtcc atttgcaggg tgtaaagcct gtgccagtgt gggaatactg gcattccgtg 3780 gatacaacag gcaatctcat gtccctcatc atcatgacaa gaagccggga gaccgtggaa 3840 tgatgcaatg gcttgataac cgttcacccc atcgtcagct tgtagagacc tcattgcaga 3900 tttcaggctg gcgagatctc tctcggtgag ttcagatagt tccatacgaa tccgattacg 3960 accaacgtga cgtacattga tcccatgctt tccttcttca aagagaacag aaggagttgg 4020 aatcagatcg ctgtccaact cagctccagc tatatccttt atgtgaacct tgatggtaaa 4080 tgcgtcatcg tatcgcagtt tcatgtcttt caacgttttt gtaatgtcgt acttgtaaag 4140 cctgtcaaac tgccacggca tttctgaact tccaccgaga actgagaagt atccagcttc 4200 ttggcaactc tggtctggtc tgcagatatc aaacgatacg gatgctgatt gtccaaatcc 4260 tttaagaagg aatcctgcga agactctatc agcttcctgt tgttgtttta tatagttatg 4320 taaacttgaa acagacatgc caacaaattc gagggaatca tactcatacc cgaatcggtg 4380 ataatcaaat gtgtcttcag gcttggagaa ctcttgagtg cgtttgttca aattataggg 4440 acttccaaag ctgaaaggtt tgagagggtc tttcatttgc tccagggcgc agtgtgcttc 4500 cttccctgaa agacctctgt gctcctggaa agcttgccag atagcccaaa tgcgatctgt 4560 ctgcgaatgg tggatataga acagtggatc gtatgatgcg taatgaagat gacctattga 4620 atgggtcttt gatcctccaa cccatgaatg aattccatta tgcaaaattt cgaactgtat 4680 ttcaaagtca cagtaatttt cttgttctag tgcatatagt gcctgattgt aaaaccagtt 4740 gttgtaaacc agatttcctt ttgcaaagag gtgttcagca tcaacatccc tctctgttgt 4800 tactccctct ccctcaaatt ctatatttgc cttgaggaac ggatttggga aggttttgtg 4860 tgaaaacagg tgctcgtatg tctcctgtgt caccaaatca gggagtgcag actgcggttt 4920 cagccaatcc caatatggaa gtccgacaat agaaccatgt cgtttcaaag aatcttcgaa 4980 ctgaactgtg tagagtctgt gccactgagg gaaggtagcc ataccatgga ggcaacaagc 5040 gtgtctatgt gcagcagatg gacttggaca aagaggaggc aaagcgtgga aagaagcaat 5100 agcttgatat ccgtctggag aatcatctgc ttgcatgttc ttgagagctt tcctcaaaaa 5160 atgtttctcc agtgttgtaa gagagcttac ttcttttcgc acaagattat gttgtgaaag 5220 cccatctgca gtttgatgtg aatgttcacc aggaacaaaa agcagactcg gttcgttgag 5280 gatgtgtgga tcaagatgcg atccattgac agctctgacg tcaactttga tgtcgaatcc 5340 atgtccgtcc aagggaacac caagtttatg cagagcagac gttatgtcaa atttgtacaa 5400 gcggtcaaag tggaatggca tttctgtttc tccccccaga atcgtgaagg aacccgcctt 5460 tgtcttgcag ttttgttctc ctccttgttc aacgcaaata taaatatcta caacaacaga 5520 tgtcccaatg ttgtgaagaa taaagccagc aaaaactctg tcgtgttgtt ttctagcctg 5580 aatagtttgg tctagctttt gtatgctgaa gtggttaaat tgaaggttat catattggta 5640 cctgaagcgg ttttgataat caaaactgtc ttgtgggagg ttgtgtttaa gcgtcatgga 5700 atcactgtta agttctgggt tattaaatgg ttgaagaggt ttacgcatct cgttgactgc 5760 gcagtcagcc tcattgtatg gtttctttct atatctctgc aagtcctgcc agatggccca 5820 gattctgtca acgtttgcat ggtgaaggaa aaatacagga tcaaatgcgg tataatcaag 5880 agacgaaagg gagtatttag cccttcctcc aagtctagaa tgaagggcgt tgtgtatcag 5940 ctcaagctga atctcaaagt cacagaagtt gtcttgctca agagccagca tgacattctc 6000 gtagaagtcc ttattgtccc atagttctgg ctgaggatct ctggacgtca cagcattctc 6060 gaaggcgata tgtcccctga agaaaggatt aggatcaaag gtctgggatc gtgaattgaa 6120 ataagttgca tcattgatta atgatggcag agagtcagct ttctcggtcc agtcccagta 6180 aggaacagca actgcagacc ctcgttctaa gagcgcattc tccacctgaa gaacgtacag 6240 tctgtgccag tggggaaagg tgggcatgcc atggacacaa caagcaacag gatgtccatc 6300 atattcacaa agtcctggtt ttccatggaa ctttgcaatg ttttcatata taccctcttt 6360 ttgaatttgg aggaaagcag atctgatgct ctcaatttct ccctctgaga ggtcccggat 6420 attttttctt atgtggctag cgcttgttac tgcctcacga tattcatcag cctggtgtga 6480 acctgcagtt ggctcgaaaa tcaccgaagg ctgagaaaag atacttccac caagacttcc 6540 gccattcaga tcaaaggctt catatttaag aaagaaatta tctccaacat gtagatctaa 6600 atcgtgcagc tgctgtgtta tctcgtactt gtacacacgg tcatacctcc aaggcatctc 6660 agcttcatct cccaaaatgt agaattctcc agcataatgg tcacagtcag atacattgtg 6720 tttgcaaacg tagaatttca cgagtgcaga ctgtccaatt tcatgaagaa ggaatcctgc 6780 aaagactctg tcatgtgact ttcttctctg caactctcgg tccagttggg caatagacag 6840 accattaaat gccaggctct catactcata gtcgaagttg ttcttgtagt cgaagactcg 6900 gaaaggaacc gaatgttcac gagtttcgtc atctggattt atgacactat caagaccaaa 6960 tggctgaagt ggttttctca tggatgcaat ggcacagttt gcagtgttgt atggtttccc 7020 ccggtatttt tgcaaagctt gccaaatggc ccaaagtcgg tctgtattgg agtggtgcaa 7080 gaagaagatt ggatcgtatg tagtatacct caaagatgac atactgtatg gttcgttacc 7140 accaactaga gcatgtatga aattatgggc aatttcaaat tgtacctcaa aatcacagaa 7200 atcgtcctgc tcaaaggcca gaaggacttg ttttgcaata tctgtgtagt gaccgaaacc 7260 aggagattga taaagatcat cacgcacagc tcttgtagta tcatgcccaa cagaatctat 7320 gtgaccatga taccatgggt tatcttcttc gaggctggaa acagaatcat tgtaagtagg 7380 atcagcaaca aaatgtggaa gggcgctcat tggtcgagtc cagtcccaat agggcaatcc 7440 accagtaaat ccatgtttcc tcagagcatt ttctccttga actgtcagca atctgtgcca 7500 gtgagggaaa acagccattc cgtgaacgca gcatgcaaat ttcttctccg cttcagggcg 7560 tgggcaccat tttggttctc cgtgaaatgc tgcaatctgt tggaatccac ctcgtgtttt 7620 gtcatcttgg agatctgcca tagctcgcct caacgaattt tgttcttcag ctgtcaggga 7680 atgaacgttc ttcctgattt ttttgctgtg gccaaagtct ttggcaacaa tatgcccagg 7740 ttcaaacatg agagcaggag acggaattag ctctggtgac aaggaagttc catcaacatc 7800 agtgatctta acagaaatag tggcatcatt cacatcaatg ttcagctttt tggcagcatc 7860 agtaatttca tatttgtaca ttctgtcgaa agcccatttc atttccttag atccaccaag 7920 cactgcaaag cttccaatcg tgtgggcctc ttggcccttt gcactgataa ggacgtctac 7980 gtttgctgat gttccaatat gagaaaggaa gaatcctgca aaacttctgt catgctgttt 8040 ctgcttgtgt atatattttt caagttctga cagggtcatg ccaccaaact ggaggtcatc 8100 atatcgatag tccaaaactc cctcatagtc gtaaacatta gttggaacag agttttggta 8160 ggttttatcg ttgttgttga gaggtgaact gaatgaaaaa ggctttaatg gttccctttc 8220 ctgtgaccag gcacaatggg ccttgtaagg tttgttcctc ctgatttgaa gagcttgcca 8280 aatggcccag agacgatcaa catttgagtg atggaggtag aaaataggat caaaagcagt 8340 gtagtgaaga gaagacatgg agtacttttc acttcctcca acccacgcat gtatgttatt 8400 gtgaacgagt tcaaactgaa cttcaaagtc gcaaaagtct tcttgttcaa aagcatagac 8460 catagcgtca aacagatgcg tgtggtgtcc aaaagacggt ttctcaaaga gtctattgtc 8520 tgtatctctg tgtgtgacgt tattttcaaa agccacttcg gcatggtaaa atggatttgc 8580 catgattgcg ctagaaactg gatggggata ggtctcactc atagccagtt ctggcagatg 8640 atccattggc atggtccagt cccagtaagg tagaccgata gtagacccgc gacgaagtag 8700 agcatcttca acttgagtga caaagagtct atgccagtgg gggaaggttg ccatgccatg 8760 aacgcagcat gcaaatctat ctttagcatc tgggcgtgga catcttgctg gaagggcatg 8820 gaattctact gtagcttggt acccgtcaac tgagtcattc tggaatcttt ccatagcttt 8880 cctcaagctg tatgtttcct ctgcagttag atcgttgata ttctttctga cagccaggtt 8940 agcttgagca actggaactg gaggcttatc aaaatgacct ttagctggtc ggtgcactaa 9000 ggtgggttgt cgaagaatgt catttgacaa agcgcttccg ttgacgctga ataactcgga 9060 cttcacgtaa taatgcccat gaccatcatc atctatagaa gcaacgctgt cagtaacatc 9120 aaagtggtat gccctgtaga atccccactt catttcagtt ggtccgccaa gaaggaagaa 9180 atctcctgct ttgatacatt gatcatcagc gatttcatca tgagcaggtc ggcagacata 9240 gactacaaca ttggcagagc caccaaaacc agataatatg aagctggcaa atgtacgctc 9300 cttagctgct cgttctccta gatatacgtt gagttcctct ggactcatgc cattcaagga 9360 gaggtcgtca tactggtagc cgaggcgccc gtagtcaaag ctatcagcag gtttggcgtg 9420 ctctcttgtc agatccagtg gattagtgtc tctgtcgaac ggttccattg gagtgtgaat 9480 aaggttatgg gcacagtcag cactgttggg atccttgcct ctgagctgtt gaagtcgttg 9540 ccatatagca aagatacggt cagtgttgaa gtgatgtaaa aagaacaaag gatcgtatga 9600 cgtgtactcg aggtgagaca tggaatatgt gtgtctgccc ccaactagat aatgtatggc 9660 gttgtgagcc agctcaaact ggatctcaaa tttgcaaaac tcgtcctgtt ccagagcgtc 9720 aaggattccc tccatgaggc gagtgtggtg tcctggctgt actttttcaa aaagacggtc 9780 atctacagca cgacttgtct tcttgttgag gaaagtgatg tcagctcggt accaagaatt 9840 gtcatgagcc ttgcccccct cagtgtcgac aaacagtgga tgagacacaa gttctggaag 9900 atgggtgagc tcactagtcc aatcccagta ggggatgctg atggttgcct tcttccttat 9960 cagcgctctt tccaactgga caacgtagag acgatgccaa aggggaaagg atggcatacc 10020 gtgaagacaa caggccacat ttttttcatg tttgtccaca cacattgttg gaacaccatg 10080 gtagcctgct atggcctgat aaccctggtt gctgtcgtct tgctggagat catcgagagc 10140 cttctcgagg gctagaacct cgtcactgct caaactgtcc acattctttc tcaccactgt 10200 gtctaccgag aaggctccct caaacaagag ggtcgccgtg aggagagcca agatggtcca 10260 cat 10263 9 3420 PRT Megathura crenulata SEQ ID NO9 is the deduced amino acid sequence of SEQ ID NO7 9 Met Trp Thr Ile Leu Ala Leu Leu Thr Ala Thr Leu Leu Phe Glu Gly 1 5 10 15 Ala Phe Ser Val Asp Thr Val Val Arg Lys Asn Val Asp Ser Leu Ser 20 25 30 Ser Asp Glu Val Leu Ala Leu Glu Lys Ala Leu Asp Asp Leu Gln Gln 35 40 45 Asp Asp Ser Asn Gln Gly Tyr Gln Ala Ile Ala Gly Tyr His Gly Val 50 55 60 Pro Thr Met Cys Val Asp Lys His Glu Lys Asn Val Ala Cys Cys Leu 65 70 75 80 His Gly Met Pro Ser Phe Pro Leu Trp His Arg Leu Tyr Val Val Gln 85 90 95 Leu Glu Arg Ala Leu Ile Arg Lys Lys Ala Thr Ile Ser Ile Pro Tyr 100 105 110 Trp Asp Trp Thr Ser Glu Leu Thr His Leu Pro Glu Leu Val Ser His 115 120 125 Pro Leu Phe Val Asp Thr Glu Gly Gly Lys Ala His Asp Asn Ser Trp 130 135 140 Tyr Arg Ala Asp Ile Thr Phe Leu Asn Lys Lys Thr Ser Arg Ala Val 145 150 155 160 Asp Asp Arg Leu Phe Glu Lys Val Gln Pro Gly His His Thr Arg Leu 165 170 175 Met Glu Gly Ile Leu Asp Ala Leu Glu Gln Asp Glu Phe Cys Lys Phe 180 185 190 Glu Ile Gln Phe Glu Leu Ala His Asn Ala Ile His Tyr Leu Val Gly 195 200 205 Gly Arg His Thr Tyr Ser Met Ser His Leu Glu Tyr Thr Ser Tyr Asp 210 215 220 Pro Leu Phe Phe Leu His His Phe Asn Thr Asp Arg Ile Phe Ala Ile 225 230 235 240 Trp Gln Arg Leu Gln Gln Leu Arg Gly Lys Asp Pro Asn Ser Ala Asp 245 250 255 Cys Ala His Asn Leu Ile His Thr Pro Met Glu Pro Phe Asp Arg Asp 260 265 270 Thr Asn Pro Leu Asp Leu Thr Arg Glu His Ala Lys Pro Ala Asp Ser 275 280 285 Phe Asp Tyr Gly Arg Leu Gly Tyr Gln Tyr Asp Asp Leu Ser Leu Asn 290 295 300 Gly Met Ser Pro Glu Glu Leu Asn Val Tyr Leu Gly Glu Arg Ala Ala 305 310 315 320 Lys Glu Arg Thr Phe Ala Ser Phe Ile Leu Ser Gly Phe Gly Gly Ser 325 330 335 Ala Asn Val Val Val Tyr Val Cys Arg Pro Ala His Asp Glu Ile Ala 340 345 350 Asp Asp Gln Cys Ile Lys Ala Gly Asp Phe Phe Leu Leu Gly Gly Pro 355 360 365 Thr Glu Met Lys Trp Gly Phe Tyr Arg Ala Tyr His Phe Asp Val Thr 370 375 380 Asp Ser Val Ala Ser Ile Asp Asp Asp Gly His Gly His Tyr Tyr Val 385 390 395 400 Lys Ser Glu Leu Phe Ser Val Asn Gly Ser Ala Leu Ser Asn Asp Ile 405 410 415 Leu Arg Gln Pro Thr Leu Val His Arg Pro Ala Lys Gly His Phe Asp 420 425 430 Lys Pro Pro Val Pro Val Ala Gln Ala Asn Leu Ala Val Arg Lys Asn 435 440 445 Ile Asn Asp Leu Thr Ala Glu Glu Thr Tyr Ser Leu Arg Lys Ala Met 450 455 460 Glu Arg Phe Gln Asn Asp Ser Val Asp Gly Tyr Gln Ala Thr Val Glu 465 470 475 480 Phe His Ala Leu Pro Ala Arg Cys Pro Arg Pro Asp Ala Lys Asp Arg 485 490 495 Phe Ala Cys Cys Val His Gly Met Ala Thr Phe Pro His Trp His Arg 500 505 510 Leu Phe Val Thr Gln Val Glu Asp Ala Leu Leu Arg Arg Gly Ser Thr 515 520 525 Ile Gly Leu Pro Tyr Trp Asp Trp Thr Met Pro Met Asp His Leu Pro 530 535 540 Glu Leu Ala Met Ser Glu Thr Tyr Pro His Pro Val Ser Ser Ala Ile 545 550 555 560 Met Ala Asn Pro Phe Tyr His Ala Glu Val Ala Phe Glu Asn Asn Val 565 570 575 Thr His Arg Asp Thr Asp Asn Arg Leu Phe Glu Lys Pro Ser Phe Gly 580 585 590 His His Thr His Leu Phe Asp Ala Met Val Tyr Ala Phe Glu Gln Glu 595 600 605 Asp Phe Cys Asp Phe Glu Val Gln Phe Glu Leu Val His Asn Asn Ile 610 615 620 His Ala Trp Val Gly Gly Ser Glu Lys Tyr Ser Met Ser Ser Leu His 625 630 635 640 Tyr Thr Ala Phe Asp Pro Ile Phe Tyr Leu His His Ser Asn Val Asp 645 650 655 Arg Leu Trp Ala Ile Trp Gln Ala Leu Gln Ile Arg Arg Asn Lys Pro 660 665 670 Tyr Lys Ala His Cys Ala Trp Ser Gln Glu Arg Glu Pro Leu Lys Pro 675 680 685 Phe Ser Phe Ser Ser Pro Leu Asn Asn Asn Asp Lys Thr Tyr Gln Asn 690 695 700 Ser Val Pro Thr Asn Val Tyr Asp Tyr Glu Gly Val Leu Asp Tyr Arg 705 710 715 720 Tyr Asp Asp Leu Gln Phe Gly Gly Met Thr Leu Ser Glu Leu Glu Lys 725 730 735 Tyr Ile His Lys Gln Lys Gln His Asp Arg Ser Phe Ala Gly Phe Phe 740 745 750 Leu Ser His Ile Gly Thr Ser Ala Asn Val Asp Val Leu Ile Ser Ala 755 760 765 Lys Gly Gln Glu Ala His Thr Ile Gly Ser Phe Ala Val Leu Gly Gly 770 775 780 Ser Lys Glu Met Lys Trp Ala Phe Asp Arg Met Tyr Lys Tyr Glu Ile 785 790 795 800 Thr Asp Ala Ala Lys Lys Leu Asn Ile Asp Val Asn Asp Ala Thr Ile 805 810 815 Ser Val Lys Ile Thr Asp Val Asp Gly Thr Ser Leu Ser Pro Glu Leu 820 825 830 Ile Pro Ser Pro Ala Leu Met Phe Glu Pro Gly His Ile Val Ala Lys 835 840 845 Asp Phe Gly His Ser Lys Lys Ile Arg Lys Asn Val His Ser Leu Thr 850 855 860 Ala Glu Glu Gln Asn Ser Leu Arg Arg Ala Met Ala Asp Leu Gln Asp 865 870 875 880 Asp Lys Thr Arg Gly Gly Phe Gln Gln Ile Ala Ala Phe His Gly Glu 885 890 895 Pro Lys Trp Cys Pro Arg Pro Glu Ala Glu Lys Lys Phe Ala Cys Cys 900 905 910 Val His Gly Met Ala Val Phe Pro His Trp His Arg Leu Leu Thr Val 915 920 925 Gln Gly Glu Asn Ala Leu Arg Lys His Gly Phe Thr Gly Gly Leu Pro 930 935 940 Tyr Trp Asp Trp Thr Arg Pro Met Ser Ala Leu Pro His Phe Val Ala 945 950 955 960 Asp Pro Thr Tyr Asn Asp Ser Val Ser Ser Leu Glu Glu Asp Asn Pro 965 970 975 Trp Tyr His Gly His Ile Asp Ser Val Gly His Asp Thr Thr Arg Ala 980 985 990 Val Arg Asp Asp Leu Tyr Gln Ser Pro Gly Phe Gly His Tyr Thr Asp 995 1000 1005 Ile Ala Lys Gln Val Leu Leu Ala Phe Glu Gln Asp Asp Phe Cys 1010 1015 1020 Asp Phe Glu Val Gln Phe Glu Ile Ala His Asn Phe Ile His Ala 1025 1030 1035 Leu Val Gly Gly Asn Glu Pro Tyr Ser Met Ser Ser Leu Arg Tyr 1040 1045 1050 Thr Thr Tyr Asp Pro Ile Phe Phe Leu His His Ser Asn Thr Asp 1055 1060 1065 Arg Leu Trp Ala Ile Trp Gln Ala Leu Gln Lys Tyr Arg Gly Lys 1070 1075 1080 Pro Tyr Asn Thr Ala Asn Cys Ala Ile Ala Ser Met Arg Lys Pro 1085 1090 1095 Leu Gln Pro Phe Gly Leu Asp Ser Val Ile Asn Pro Asp Asp Glu 1100 1105 1110 Thr Arg Glu His Ser Val Pro Phe Arg Val Phe Asp Tyr Lys Asn 1115 1120 1125 Asn Phe Asp Tyr Glu Tyr Glu Ser Leu Ala Phe Asn Gly Leu Ser 1130 1135 1140 Ile Ala Gln Leu Asp Arg Glu Leu Gln Arg Arg Lys Ser His Asp 1145 1150 1155 Arg Val Phe Ala Gly Phe Leu Leu His Glu Ile Gly Gln Ser Ala 1160 1165 1170 Leu Val Lys Phe Tyr Val Cys Lys His Asn Val Ser Asp Cys Asp 1175 1180 1185 His Tyr Ala Gly Glu Phe Tyr Ile Leu Gly Asp Glu Ala Glu Met 1190 1195 1200 Pro Trp Arg Tyr Asp Arg Val Tyr Lys Tyr Glu Ile Thr Gln Gln 1205 1210 1215 Leu His Asp Leu Asp Leu His Val Gly Asp Asn Phe Phe Leu Lys 1220 1225 1230 Tyr Glu Ala Phe Asp Leu Asn Gly Gly Ser Leu Gly Gly Ser Ile 1235 1240 1245 Phe Ser Gln Pro Ser Val Ile Phe Glu Pro Thr Ala Gly Ser His 1250 1255 1260 Gln Ala Asp Glu Tyr Arg Glu Ala Val Thr Ser Ala Ser His Ile 1265 1270 1275 Arg Lys Asn Ile Arg Asp Leu Ser Glu Gly Glu Ile Glu Ser Ile 1280 1285 1290 Arg Ser Ala Phe Leu Gln Ile Gln Lys Glu Gly Ile Tyr Glu Asn 1295 1300 1305 Ile Ala Lys Phe His Gly Lys Pro Gly Leu Cys Glu Tyr Asp Gly 1310 1315 1320 His Pro Val Ala Cys Cys Val His Gly Met Pro Thr Phe Pro His 1325 1330 1335 Trp His Arg Leu Tyr Val Leu Gln Val Glu Asn Ala Leu Leu Glu 1340 1345 1350 Arg Gly Ser Ala Val Ala Val Pro Tyr Trp Asp Trp Thr Glu Lys 1355 1360 1365 Ala Asp Ser Leu Pro Ser Leu Ile Asn Asp Ala Thr Tyr Phe Asn 1370 1375 1380 Ser Arg Ser Gln Thr Phe Asp Pro Asn Pro Phe Phe Arg Gly His 1385 1390 1395 Ile Ala Phe Glu Asn Ala Val Thr Ser Arg Asp Pro Gln Pro Glu 1400 1405 1410 Leu Trp Asp Asn Lys Asp Phe Tyr Glu Asn Val Met Leu Ala Leu 1415 1420 1425 Glu Gln Asp Asn Phe Cys Asp Phe Glu Ile Gln Leu Glu Leu Ile 1430 1435 1440 His Asn Ala Leu His Ser Arg Leu Gly Gly Arg Ala Lys Tyr Ser 1445 1450 1455 Leu Ser Ser Leu Asp Tyr Thr Ala Phe Asp Pro Val Phe Phe Leu 1460 1465 1470 His His Ala Asn Val Asp Arg Ile Trp Ala Ile Trp Gln Asp Leu 1475 1480 1485 Gln Arg Tyr Arg Lys Lys Pro Tyr Asn Glu Ala Asp Cys Ala Val 1490 1495 1500 Asn Glu Met Arg Lys Pro Leu Gln Pro Phe Asn Asn Pro Glu Leu 1505 1510 1515 Asn Ser Asp Ser Met Thr Leu Lys His Asn Leu Pro Gln Asp Ser 1520 1525 1530 Phe Asp Tyr Gln Asn Arg Phe Arg Tyr Gln Tyr Asp Asn Leu Gln 1535 1540 1545 Phe Asn His Phe Ser Ile Gln Lys Leu Asp Gln Thr Ile Gln Ala 1550 1555 1560 Arg Lys Gln His Asp Arg Val Phe Ala Gly Phe Ile Leu His Asn 1565 1570 1575 Ile Gly Thr Ser Val Val Val Asp Ile Tyr Ile Cys Val Glu Gln 1580 1585 1590 Gly Gly Glu Gln Asn Cys Lys Thr Lys Ala Gly Ser Phe Thr Ile 1595 1600 1605 Leu Gly Gly Glu Thr Glu Met Pro Phe His Phe Asp Arg Leu Tyr 1610 1615 1620 Lys Phe Asp Ile Thr Ser Ala Leu His Lys Leu Gly Val Pro Leu 1625 1630 1635 Asp Gly His Gly Phe Asp Ile Lys Val Asp Val Arg Ala Val Asn 1640 1645 1650 Gly Ser His Leu Asp Pro His Ile Leu Asn Glu Pro Ser Leu Leu 1655 1660 1665 Phe Val Pro Gly Glu His Ser His Gln Thr Ala Asp Gly Leu Ser 1670 1675 1680 Gln His Asn Leu Val Arg Lys Glu Val Ser Ser Leu Thr Thr Leu 1685 1690 1695 Glu Lys His Phe Leu Arg Lys Ala Leu Lys Asn Met Gln Ala Asp 1700 1705 1710 Asp Ser Pro Asp Gly Tyr Gln Ala Ile Ala Ser Phe His Ala Leu 1715 1720 1725 Pro Pro Leu Cys Pro Ser Pro Ser Ala Ala His Arg His Ala Cys 1730 1735 1740 Cys Leu His Gly Met Ala Thr Phe Pro Gln Trp His Arg Leu Tyr 1745 1750 1755 Thr Val Gln Phe Glu Asp Ser Leu Lys Arg His Gly Ser Ile Val 1760 1765 1770 Gly Leu Pro Tyr Trp Asp Trp Leu Lys Pro Gln Ser Ala Leu Pro 1775 1780 1785 Asp Leu Val Thr Gln Glu Thr Tyr Glu His Leu Phe Ser His Lys 1790 1795 1800 Thr Phe Pro Asn Pro Phe Leu Lys Ala Asn Ile Glu Phe Glu Gly 1805 1810 1815 Glu Gly Val Thr Thr Glu Arg Asp Val Asp Ala Glu His Leu Phe 1820 1825 1830 Ala Lys Gly Asn Leu Val Tyr Asn Asn Trp Phe Tyr Asn Gln Ala 1835 1840 1845 Leu Tyr Ala Leu Glu Gln Glu Asn Tyr Cys Asp Phe Glu Ile Gln 1850 1855 1860 Phe Glu Ile Leu His Asn Gly Ile His Ser Trp Val Gly Gly Ser 1865 1870 1875 Lys Thr His Ser Ile Gly His Leu His Tyr Ala Ser Tyr Asp Pro 1880 1885 1890 Leu Phe Tyr Ile His His Ser Gln Thr Asp Arg Ile Trp Ala Ile 1895 1900 1905 Trp Gln Ala Phe Gln Glu His Arg Gly Leu Ser Gly Lys Glu Ala 1910 1915 1920 His Cys Ala Leu Glu Gln Met Lys Asp Pro Leu Lys Pro Phe Ser 1925 1930 1935 Phe Gly Ser Pro Tyr Asn Leu Asn Lys Arg Thr Gln Glu Phe Ser 1940 1945 1950 Lys Pro Glu Asp Thr Phe Asp Tyr His Arg Phe Gly Tyr Glu Tyr 1955 1960 1965 Asp Ser Leu Glu Phe Val Gly Met Ser Val Ser Ser Leu His Asn 1970 1975 1980 Tyr Ile Lys Gln Gln Gln Glu Ala Asp Arg Val Phe Ala Gly Phe 1985 1990 1995 Leu Leu Lys Gly Phe Gly Gln Ser Ala Ser Val Ser Phe Asp Ile 2000 2005 2010 Cys Arg Pro Asp Gln Ser Cys Gln Glu Ala Gly Tyr Phe Ser Val 2015 2020 2025 Leu Gly Gly Ser Ser Glu Met Pro Trp Gln Phe Asp Arg Leu Tyr 2030 2035 2040 Lys Tyr Asp Ile Thr Lys Thr Leu Lys Asp Met Lys Leu Arg Tyr 2045 2050 2055 Asp Asp Ala Phe Thr Ile Lys Val His Ile Lys Asp Ile Ala Gly 2060 2065 2070 Ala Glu Leu Asp Ser Asp Leu Ile Pro Thr Pro Ser Val Leu Phe 2075 2080 2085 Glu Glu Gly Lys His Gly Ile Asn Val Arg His Val Gly Arg Asn 2090 2095 2100 Arg Ile Arg Met Glu Leu Ser Glu Leu Thr Glu Arg Asp Leu Ala 2105 2110 2115 Ser Leu Lys Ser Ala Met Arg Ser Leu Gln Ala Asp Asp Gly Val 2120 2125 2130 Asn Gly Tyr Gln Ala Ile Ala Ser Phe His Gly Leu Pro Ala Ser 2135 2140 2145 Cys His Asp Asp Glu Gly His Glu Ile Ala Cys Cys Ile His Gly 2150 2155 2160 Met Pro Val Phe Pro His Trp His Arg Leu Tyr Thr Leu Gln Met 2165 2170 2175 Asp Met Ala Leu Leu Ser His Gly Ser Ala Val Ala Ile Pro Tyr 2180 2185 2190 Trp Asp Trp Thr Lys Pro Ile Ser Lys Leu Pro Asp Leu Phe Thr 2195 2200 2205 Ser Pro Glu Tyr Tyr Asp Pro Trp Arg Asp Ala Val Val Asn Asn 2210 2215 2220 Pro Phe Ala Lys Gly Tyr Ile Lys Ser Glu Asp Ala Tyr Thr Val 2225 2230 2235 Arg Asp Pro Gln Asp Ile Leu Tyr His Leu Gln Asp Glu Thr Gly 2240 2245 2250 Thr Ser Val Leu Leu Asp Gln Thr Leu Leu Ala Leu Glu Gln Thr 2255 2260 2265 Asp Phe Cys Asp Phe Glu Val Gln Phe Glu Val Val His Asn Ala 2270 2275 2280 Ile His Tyr Leu Val Gly Gly Arg Gln Val Tyr Ala Leu Ser Ser 2285 2290 2295 Leu His Tyr Ala Ser Tyr Asp Pro Ala Phe Phe Ile His His Ser 2300 2305 2310 Phe Val Asp Lys Ile Trp Ala Val Trp Gln Ala Leu Gln Lys Lys 2315 2320 2325 Arg Lys Arg Pro Tyr His Lys Ala Asp Cys Ala Leu Asn Met Met 2330 2335 2340 Thr Lys Pro Met Arg Pro Phe Ala His Asp Phe Asn His Asn Gly 2345 2350 2355 Phe Thr Lys Met His Ala Val Pro Asn Thr Leu Phe Asp Phe Gln 2360 2365 2370 Asp Leu Phe Tyr Thr Tyr Asp Asn Leu Glu Ile Ala Gly Met Asn 2375 2380 2385 Val Asn Gln Leu Glu Ala Glu Ile Asn Arg Arg Lys Ser Gln Thr 2390 2395 2400 Arg Val Phe Ala Gly Phe Leu Leu His Gly Ile Gly Arg Ser Ala 2405 2410 2415 Asp Val Arg Phe Trp Ile Cys Lys Thr Ala Asp Asp Cys His Ala 2420 2425 2430 Ser Gly Met Ile Phe Ile Leu Gly Gly Ser Lys Glu Met His Trp 2435 2440 2445 Ala Tyr Asp Arg Asn Phe Lys Tyr Asp Ile Thr Gln Ala Leu Lys 2450 2455 2460 Ala Gln Ser Ile His Pro Glu Asp Val Phe Asp Thr Asp Ala Pro 2465 2470 2475 Phe Phe Ile Lys Val Glu Ile His Gly Val Asn Lys Thr Ala Leu 2480 2485 2490 Pro Ser Ser Ala Ile Pro Ala Pro Thr Ile Ile Tyr Ser Ala Gly 2495 2500 2505 Glu Gly His Thr Asp Asp His Gly Ser Asp His Ile Ala Gly Ser 2510 2515 2520 Gly Val Arg Lys Asp Val Thr Ser Leu Thr Ala Ser Glu Ile Glu 2525 2530 2535 Asn Leu Arg His Ala Leu Gln Ser Val Met Asp Asp Asp Gly Pro 2540 2545 2550 Asn Gly Phe Gln Ala Ile Ala Ala Tyr His Gly Ser Pro Pro Met 2555 2560 2565 Cys His Met Pro Asp Gly Arg Asp Val Ala Cys Cys Thr His Gly 2570 2575 2580 Met Ala Ser Phe Pro His Trp His Arg Leu Phe Val Lys Gln Met 2585 2590 2595 Glu Asp Ala Leu Ala Ala His Gly Ala His Ile Gly Ile Pro Tyr 2600 2605 2610 Trp Asp Trp Thr Ser Ala Phe Ser His Leu Pro Ala Leu Val Thr 2615 2620 2625 Asp His Glu His Asn Pro Phe His His Gly His Ile Ala His Arg 2630 2635 2640 Asn Val Asp Thr Ser Arg Ser Pro Arg Asp Met Leu Phe Asn Asp 2645 2650 2655 Pro Glu His Gly Ser Glu Ser Phe Phe Tyr Arg Gln Val Leu Leu 2660 2665 2670 Ala Leu Glu Gln Thr Asp Phe Cys Gln Phe Glu Val Gln Phe Glu 2675 2680 2685 Ile Thr His Asn Ala Ile His Ser Trp Thr Gly Gly His Thr Pro 2690 2695 2700 Tyr Gly Met Ser Ser Leu Glu Tyr Thr Ala Tyr Asp Pro Leu Phe 2705 2710 2715 Tyr Leu His His Ser Asn Thr Asp Arg Ile Trp Ala Ile Trp Gln 2720 2725 2730 Ala Leu Gln Lys Tyr Arg Gly Phe Gln Tyr Asn Ala Ala His Cys 2735 2740 2745 Asp Ile Gln Val Leu Lys Gln Pro Leu Lys Pro Phe Ser Glu Ser 2750 2755 2760 Arg Asn Pro Asn Pro Val Thr Arg Ala Asn Ser Arg Ala Val Asp 2765 2770 2775 Ser Phe Asp Tyr Glu Arg Leu Asn Tyr Gln Tyr Asp Thr Leu Thr 2780 2785 2790 Phe His Gly His Ser Ile Ser Glu Leu Asp Ala Met Leu Gln Glu 2795 2800 2805 Arg Lys Lys Glu Glu Arg Thr Phe Ala Ala Phe Leu Leu His Gly 2810 2815 2820 Phe Gly Ala Ser Ala Asp Val Ser Phe Asp Val Cys Thr Pro Asp 2825 2830 2835 Gly His Cys Ala Phe Ala Gly Thr Phe Ala Val Leu Gly Gly Glu 2840 2845 2850 Leu Glu Met Pro Trp Ser Phe Glu Arg Leu Phe Arg Tyr Asp Ile 2855 2860 2865 Thr Lys Val Leu Lys Gln Met Asn Leu Tyr Tyr Asp Ser Glu Phe 2870 2875 2880 His Phe Ala Leu Lys Val Val Gly Thr Asp Gly Thr Glu Leu Pro 2885 2890 2895 Ser Asp Ser Ile Lys Ser Pro Thr Ile Glu His His Gln Ala Asp 2900 2905 2910 His Gln Gly His Gly Gly Glu Gly Gly Gln His Pro Glu His Thr 2915 2920 2925 Gly Phe Val Arg Lys Asp Ile Asp Ser Leu Ser Leu Asp Glu Ala 2930 2935 2940 Asn Asp Leu Lys Asn Ala Leu Tyr Lys Leu Gln Asn Asp Gln Ser 2945 2950 2955 Leu Ser Gly Tyr Glu Ser Ile Ala Gly Phe His Gly Tyr Pro Tyr 2960 2965 2970 Leu Cys Pro Asp Asn Gly Asp Asp Lys Tyr Ala Cys Cys Val His 2975 2980 2985 Gly Met Pro Val Phe Pro His Trp His Arg Leu Leu Thr Ile Gln 2990 2995 3000 Phe Glu Arg Ala Leu Lys Glu His Gly Ser His Leu Gly Val Pro 3005 3010 3015 Tyr Trp Asp Trp Thr Lys Ser Met Thr Ala Leu Pro Ala Phe Phe 3020 3025 3030 Ala Asp Ser Ser Asn Asn Asn Pro Phe Tyr Lys Tyr His Ile Met 3035 3040 3045 Lys Ala Gly His Asp Thr Thr Arg Gln Pro Asn Asp Leu Leu Phe 3050 3055 3060 Asn Gln Pro Gln Val His Gly Tyr Asp Tyr Leu Tyr Tyr Leu Ala 3065 3070 3075 Leu Ser Thr Leu Glu Glu Asp Asn Tyr Cys Asp Phe Glu Val Gln 3080 3085 3090 Tyr Glu Asn Phe His Asn Ala Val His Ile Trp Leu Gly Gly Thr 3095 3100 3105 Glu Thr Tyr Ser Met Ala Thr Leu Glu Tyr Ser Ala Phe Asp Pro 3110 3115 3120 Val Phe Met Ile Leu His Ser Gly Leu Asp Arg Leu Trp Ile Ile 3125 3130 3135 Trp Gln Lys Leu Gln Lys Leu Arg Lys Lys Pro Tyr Asn Ala Ala 3140 3145 3150 Lys Cys Ala Gly His Met Ile Asp Glu Pro Leu His Pro Phe Asp 3155 3160 3165 Tyr Glu Ser Val Asn Gln Asp Gly Phe Thr Arg Ala Asn Ala Lys 3170 3175 3180 Pro Thr Thr Val Phe Asp Ser His Arg Phe Asp Tyr His Tyr Asp 3185 3190 3195 Asn Leu Asp Val Arg Gly Lys Ser Ile Gln Glu Ile Asn Ala Ile 3200 3205 3210 Ile His Asp Leu Arg Thr Gln Pro Arg Ile Phe Ala Gly Phe Val 3215 3220 3225 Leu Ser Gly Ile Tyr Thr Ser Ala Asn Val Lys Ile Tyr Leu Ile 3230 3235 3240 Arg Glu Gly His Asp Asp Glu Tyr Val Gly Thr Phe Ala Val Leu 3245 3250 3255 Gly Gly Ala Lys Glu Met Pro Trp Ala Tyr Glu Arg Ile Phe Lys 3260 3265 3270 Tyr Asp Ile Thr Asp Val Ala His Lys Leu Lys Met His His Asp 3275 3280 3285 Asp Thr Phe Asn Phe Arg Leu Glu Ile Gln Ser Tyr Ser Gly Glu 3290 3295 3300 Thr Leu Thr Ser His Leu Pro Glu Pro Leu Ile Ile His Arg Pro 3305 3310 3315 Ala Asn Lys Asp Tyr Asp Val Leu Ile Ile Pro Leu Gly Ser Gly 3320 3325 3330 Arg Lys Leu Pro Pro Lys Val Ile Val Lys Arg Gly Thr Glu Ile 3335 3340 3345 Glu Phe His Pro Val Asp Asp Thr Ile Asn Arg Pro Val Val Asp 3350 3355 3360 Leu Gly Ser Tyr Thr Ala Leu Tyr Asn Cys Val Val Pro Pro Phe 3365 3370 3375 Thr Tyr His Gly Tyr Glu Leu Asp His Ala Tyr Ser Leu Arg Asp 3380 3385 3390 Gly His Tyr Tyr Ile Ala Gly Pro Thr Lys Asp Leu Cys Ala Ser 3395 3400 3405 Gly Asn Val Arg Ile His Ile His Ile Glu Asp Glu 3410 3415 3420 10 10495 DNA Megathura crenulata 10 gatcaccagc catctggttt gatcatgtgg accatcttgg ctctcctcac ggcgaccctc 60 ttgtttgagg gagccttctc ggtagacaca gtggtgagaa agaatgtgga cagtttgagc 120 agtgacgagg ttctagccct cgagaaggct ctcgatgatc tccagcaaga cgacagcaac 180 cagggttatc aggccatagc aggctaccat ggtgttccaa caatgtgtgt ggacaaacat 240 gaaaaaaatg tggcctgttg tcttcacggt atgccatcct ttcccctttg gcatcgtctc 300 tacgttgtcc agttggaaag agcgctgata aggaagaagg caaccatcag catcccctac 360 tgggattgga ctagtgagct cacccatctt ccagaacttg tgtctcatcc actgtttgtc 420 gacactgagg ggggcaaggc tcatgacaat tcttggtacc gagctgacat cactttcctc 480 aacaagaaga caagtcgtgc tgtagatgac cgtctttttg aaaaagtaca gccaggacac 540 cacactcgcc tcatggaggg aatccttgac gctctggaac aggacgagtt ttgcaaattt 600 gagatccagt ttgagctggc tcacaacgcc atacattatc tagttggggg cagacacaca 660 tattccatgt ctcacctcga gtacacgtca tacgatcctt tgttcttttt acatcacttc 720 aacactgacc gtatctttgc tatatggcaa cgacttcaac agctcagagg caaggatccc 780 aacagtgctg actgtgccca taaccttatt cacactccaa tggaaccgtt cgacagagac 840 actaatccac tggatctgac aagagagcac gccaaacctg ctgatagctt tgactacggg 900 cgcctcggct accagtatga cgacctctcc ttgaatggca tgagtccaga ggaactcaac 960 gtatatctag gagaacgagc agctaaggag cgtacatttg ccagcttcat attatctggt 1020 tttggtggct ctgccaatgt tgtagtctat gtctgccgac ctgctcatga tgaaatcgct 1080 gatgatcaat gtatcaaagc aggagatttc ttccttcttg gcggaccaac tgaaatgaag 1140 tggggattct acagggcata ccactttgat gttactgaca gcgttgcttc tatagatgat 1200 gatggtcatg ggcattatta cgtgaagtcc gagttattca gcgtcaacgg aagcgctttg 1260 tcaaatgaca ttcttcgaca acccacctta gtgcaccgac cagctaaagg tcattttgat 1320 aagcctccag ttccagttgc tcaagctaac ctggctgtca gaaagaatat caacgatcta 1380 actgcagagg aaacatacag cttgaggaaa gctatggaaa gattccagaa tgactcagtt 1440 gacgggtacc aagctacagt agaattccat gcccttccag caagatgtcc acgcccagat 1500 gctaaagata gatttgcatg ctgcgttcat ggcatggcaa ccttccccca ctggcataga 1560 ctctttgtca ctcaagttga agatgctcta cttcgtcgcg ggtctactat cggtctacct 1620 tactgggact ggaccatgcc aatggatcat ctgccagaac tggctatgag tgagacctat 1680 ccccatccag tttctagcgc aatcatggca aatccatttt accatgccga agtggctttt 1740 gaaaataacg tcacacacag agatacagac aatagactct ttgagaaacc gtcttttgga 1800 caccacacgc atctgtttga cgctatggtc tatgcttttg aacaagaaga cttttgcgac 1860 tttgaagttc agtttgaact cgttcacaat aacatacatg cgtgggttgg aggaagtgaa 1920 aagtactcca tgtcttctct tcactacact gcttttgatc ctattttcta cctccatcac 1980 tcaaatgttg atcgtctctg ggccatttgg caagctcttc aaatcaggag gaacaaacct 2040 tacaaggccc attgtgcctg gtcacaggaa agggaaccat taaagccttt ttcattcagt 2100 tcacctctca acaacaacga taaaacctac caaaactctg ttccaactaa tgtttacgac 2160 tatgagggag ttttggacta tcgatatgat gacctccagt ttggtggcat gaccctgtca 2220 gaacttgaaa aatatataca caagcagaaa cagcatgaca gaagttttgc aggattcttc 2280 ctttctcata ttggaacatc agcaaacgta gacgtcctta tcagtgcaaa gggccaagag 2340 gcccacacga ttggaagctt tgcagtgctt ggtggatcta aggaaatgaa atgggctttc 2400 gacagaatgt acaaatatga aattactgat gctgccaaaa agctgaacat tgatgtgaat 2460 gatgccacta tttctgttaa gatcactgat gttgatggaa cttccttgtc accagagcta 2520 attccgtctc ctgctctcat gtttgaacct gggcatattg ttgccaaaga ctttggccac 2580 agcaaaaaaa tcaggaagaa cgttcattcc ctgacagctg aagaacaaaa ttcgttgagg 2640 cgagctatgg cagatctcca agatgacaaa acacgaggtg gattccaaca gattgcagca 2700 tttcacggag aaccaaaatg gtgcccacgc cctgaagcgg agaagaaatt tgcatgctgc 2760 gttcacggaa tggctgtttt ccctcactgg cacagattgc tgacagttca aggagaaaat 2820 gctctgagga aacatggatt tactggtgga ttgccctatt gggactggac tcgaccaatg 2880 agcgcccttc cacattttgt tgctgatcct acttacaatg attctgtttc cagcctcgaa 2940 gaagataacc catggtatca tggtcacata gattctgttg ggcatgatac tacaagagct 3000 gtgcgtgatg atctttatca atctcctggt ttcggtcact acacagatat tgcaaaacaa 3060 gtccttctgg cctttgagca ggacgatttc tgtgattttg aggtacaatt tgaaattgcc 3120 cataatttca tacatgctct agttggtggt aacgaaccat acagtatgtc atctttgagg 3180 tatactacat acgatccaat cttcttcttg caccactcca atacagaccg actttgggcc 3240 atttggcaag ctttgcaaaa ataccggggg aaaccataca acactgcaaa ctgtgccatt 3300 gcatccatga gaaaaccact tcagccattt ggtcttgata gtgtcataaa tccagatgac 3360 gaaactcgtg aacattcggt tcctttccga gtcttcgact acaagaacaa cttcgactat 3420 gagtatgaga gcctggcatt taatggtctg tctattgccc aactggaccg agagttgcag 3480 agaagaaagt cacatgacag agtctttgca ggattccttc ttcatgaaat tggacagtct 3540 gcactcgtga aattctacgt ttgcaaacac aatgtatctg actgtgacca ttatgctgga 3600 gaattctaca ttttgggaga tgaagctgag atgccttgga ggtatgaccg tgtgtacaag 3660 tacgagataa cacagcagct gcacgattta gatctacatg ttggagataa tttctttctt 3720 aaatatgaag cctttgatct gaatggcgga agtcttggtg gaagtatctt ttctcagcct 3780 tcggtgattt tcgagccaac tgcaggttca caccaggctg atgaatatcg tgaggcagta 3840 acaagcgcta gccacataag aaaaaatatc cgggacctct cagagggaga aattgagagc 3900 atcagatctg ctttcctcca aattcaaaaa gagggtatat atgaaaacat tgcaaagttc 3960 catggaaaac caggactttg tgaatatgat ggacatcctg ttgcttgttg tgtccatggc 4020 atgcccacct ttccccactg gcacagactg tacgttcttc aggtggagaa tgcgctctta 4080 gaacgagggt ctgcagttgc tgttccttac tgggactgga ccgagaaagc tgactctctg 4140 ccatcattaa tcaatgatgc aacttatttc aattcacgat cccagacctt tgatcctaat 4200 cctttcttca ggggacatat cgccttcgag aatgctgtga cgtccagaga tcctcagcca 4260 gaactatggg acaataagga cttctacgag aatgtcatgc tggctcttga gcaagacaac 4320 ttctgtgact ttgagattca gcttgagctg atacacaacg cccttcattc tagacttgga 4380 ggaagggcta aatactccct ttcgtctctt gattataccg catttgatcc tgtatttttc 4440 cttcaccatg caaacgttga cagaatctgg gccatctggc aggacttgca gagatataga 4500 aagaaaccat acaatgaggc tgactgcgca gtcaacgaga tgcgtaaacc tcttcaacca 4560 tttaataacc cagaacttaa cagtgattcc atgacgctta aacacaacct cccacaagac 4620 agttttgatt atcaaaaccg cttcaggtac caatatgata accttcaatt taaccacttc 4680 agcatacaaa agctagacca aactattcag gctagaaaac aacacgacag agtttttgct 4740 ggctttattc ttcacaacat tgggacatct gttgttgtag atatttatat ttgcgttgaa 4800 caaggaggag aacaaaactg caagacaaag gcgggttcct tcacgattct ggggggagaa 4860 acagaaatgc cattccactt tgaccgcttg tacaaatttg acataacgtc tgctctgcat 4920 aaacttggtg ttcccttgga cggacatgga ttcgacatca aagttgacgt cagagctgtc 4980 aatggatcgc atcttgatcc acacatcctc aacgaaccga gtctgctttt tgttcctggt 5040 gaacattcac atcaaactgc agatgggctt tcacaacata atcttgtgcg aaaagaagta 5100 agctctctta caacactgga gaaacatttt ttgaggaaag ctctcaagaa catgcaagca 5160 gatgattctc cagacggata tcaagctatt gcttctttcc acgctttgcc tcctctttgt 5220 ccaagtccat ctgctgcaca tagacacgct tgttgcctcc atggtatggc taccttccct 5280 cagtggcaca gactctacac agttcagttc gaagattctt tgaaacgaca tggttctatt 5340 gtcggacttc catattggga ttggctgaaa ccgcagtctg cactccctga tttggtgaca 5400 caggagacat acgagcacct gttttcacac aaaaccttcc caaatccgtt cctcaaggca 5460 aatatagaat ttgagggaga gggagtaaca acagagaggg atgttgatgc tgaacacctc 5520 tttgcaaaag gaaatctggt ttacaacaac tggttttaca atcaggcact atatgcacta 5580 gaacaagaaa attactgtga ctttgaaata cagttcgaaa ttttgcataa tggaattcat 5640 tcatgggttg gaggatcaaa gacccattca ataggtcatc ttcattacgc atcatacgat 5700 ccactgttct atatccacca ttcgcagaca gatcgcattt gggctatctg gcaagctttc 5760 caggagcaca gaggtctttc agggaaggaa gcacactgcg ccctggagca aatgaaagac 5820 cctctcaaac ctttcagctt tggaagtccc tataatttga acaaacgcac tcaagagttc 5880 tccaagcctg aagacacatt tgattatcac cgattcgggt atgagtatga ttccctcgaa 5940 tttgttggca tgtctgtttc aagtttacat aactatataa aacaacaaca ggaagctgat 6000 agagtcttcg caggattcct tcttaaagga tttggacaat cagcatccgt atcgtttgat 6060 atctgcagac cagaccagag ttgccaagaa gctggatact tctcagttct cggtggaagt 6120 tcagaaatgc cgtggcagtt tgacaggctt tacaagtacg acattacaaa aacgttgaaa 6180 gacatgaaac tgcgatacga tgacgcattt accatcaagg ttcacataaa ggatatagct 6240 ggagctgagt tggacagcga tctgattcca actccttctg ttctctttga agaaggaaag 6300 catgggatca atgtacgtca cgttggtcgt aatcggattc gtatggaact atctgaactc 6360 accgagagag atctcgccag cctgaaatct gcaatgaggt ctctacaagc tgacgatggg 6420 gtgaacggtt atcaagccat tgcatcattc cacggtctcc cggcttcttg tcatgatgat 6480 gagggacatg agattgcctg ttgtatccac ggaatgccag tattcccaca ctggcacagg 6540 ctttacaccc tgcaaatgga catggctctg ttatctcacg gatctgctgt tgctattcca 6600 tactgggact ggaccaaacc tatcagcaaa ctgcctgatc tcttcaccag ccctgaatat 6660 tacgatcctt ggagggatgc agttgtcaat aatccatttg ctaaaggcta cattaaatcc 6720 gaggacgctt acacggttag ggatcctcag gacattttgt accacttgca ggacgaaacg 6780 ggaacatctg ttttgttaga tcaaactctt ttagccttag aacagacaga tttctgtgat 6840 tttgaggttc aatttgaggt cgtccataat gctattcact acttggtggg tggtcgacaa 6900 gtttatgctc tttcttctct acactatgct tcatatgacc cagccttctt tattcatcac 6960 tcctttgttg acaaaatatg ggcagtctgg caagctctgc aaaagaagag aaagcgtccc 7020 tatcataaag cggattgtgc tcttaacatg atgaccaaac caatgcgacc atttgcacac 7080 gatttcaatc acaatggatt cacaaaaatg cacgcagtcc ccaacactct atttgacttt 7140 caggaccttt tctacacgta tgacaactta gaaattgctg gcatgaatgt taatcagttg 7200 gaagcggaaa tcaaccggcg aaaaagccaa acaagagtct ttgccgggtt ccttctacat 7260 ggcattggaa gatcagctga tgtacgattt tggatttgca agacagctga cgactgccac 7320 gcatctggca tgatctttat cttaggaggt tctaaagaga tgcactgggc ctatgacagg 7380 aactttaaat acgacatcac ccaagctttg aaggctcagt ccatacaccc tgaagatgtg 7440 tttgacactg atgctccttt cttcattaaa gtggagatcc atggtgtaaa caagactgct 7500 ctcccatctt cagctatccc agcacctact ataatctact cagctggtga aggtcatacc 7560 gatgatcacg gctcagatca tattgctggc agtggagtca ggaaagacgt gacgtctctt 7620 accgcatctg agatagagaa cctgaggcat gctctgcaaa gcgtgatgga tgatgatgga 7680 cccaatggat tccaggcaat tgctgcttat cacggaagtc ctcccatgtg tcacatgcct 7740 gatggtagag acgttgcatg ttgtactcat ggaatggcat ctttccctca ctggcacaga 7800 ctgtttgtga aacagatgga ggatgcactg gctgcgcatg gagctcacat tggcatacca 7860 tactgggatt ggacaagtgc gtttagtcat ctgcctgccc tagtgactga ccacgagcac 7920 aatcccttcc accacggaca tattgctcat cggaatgtgg atacatctcg atctccgaga 7980 gacatgctgt tcaatgaccc tgaacacggg tcagaatcat tcttctatag acaggttctc 8040 ttggctctag aacagacaga cttctgccaa tttgaagttc agtttgaaat aacacacaat 8100 gcaatccact cttggactgg aggacatact ccatatggaa tgtcatcact ggaatataca 8160 gcatatgatc cactctttta tctccaccat tccaacactg atcgtatctg ggccatctgg 8220 caggcactcc agaaatacag aggttttcaa tacaacgcag ctcattgcga tatccaggtt 8280 ctgaaacaac ctcttaaacc attcagcgag tccaggaatc caaacccagt caccagagcc 8340 aattctaggg cagtcgattc atttgattat gagagactca attatcaata tgacacactt 8400 accttccacg gacattctat ctcagaactt gatgccatgc ttcaagagag aaagaaggaa 8460 gagagaacat ttgcagcctt cctgttgcac ggatttggcg ccagtgctga tgtttcgttt 8520 gatgtctgca cacctgatgg tcattgtgcc tttgctggaa ccttcgcggt acttggtggg 8580 gagctggaga tgccctggtc ctttgaaaga ttgttccgtt atgatatcac aaaggttctc 8640 aagcagatga atctttacta tgattctgag tttcatttcg cgttgaaggt tgttggcaca 8700 gatggaacag aactgccatc ggatagtatc aagagcccaa caattgagca tcatcaagca 8760 gatcaccagg gtcatggtgg ggagggtggc caacatcctg aacatactgg atttgtaagg 8820 aaggacatag attctctttc cttggatgaa gccaatgatc tgaaaaatgc actgtacaag 8880 ctgcagaacg accaaagcct cagtggatat gaatcaatag ctggtttcca tggctatccc 8940 tacctctgtc ctgacaacgg cgatgacaag tatgcctgct gtgtccatgg aatgcctgta 9000 tttccacact ggcacagact tcttactatc cagtttgaga gagcactcaa agaacatggt 9060 tcacatttgg gtgttcccta ttgggattgg accaagtcaa tgactgctct tccagctttc 9120 tttgccgatt cttccaataa caaccctttc tacaaatatc acattatgaa agctggacat 9180 gatacaacga gacagccgaa tgatcttctc ttcaatcaac cacaggttca tggatatgac 9240 tatctatact accttgccct ttcaacacta gaagaggata actactgtga ctttgaggtt 9300 cagtatgaaa attttcacaa tgcagtacat atctggctcg ggggaactga aacctactcc 9360 atggctactc ttgaatattc agcgtttgat ccagtcttta tgatcttgca ttctggtctc 9420 gacagactct ggatcatttg gcaaaagctg caaaagctca gaaagaaacc atataatgca 9480 gccaaatgtg ctggccacat gattgatgaa cctttgcatc cttttgacta cgagtctgtt 9540 aaccaagatg gcttcactcg agcaaacgct aaacctacaa cagtgtttga cagtcacagg 9600 tttgattatc actatgacaa cctggatgta cgaggaaaga gtattcagga aataaacgcc 9660 atcatccacg atttgaggac ccaaccaaga atctttgctg gttttgttct ttctggaata 9720 tacacttcag ctaatgttaa gatatactta atcagagaag gccatgacga tgaatatgta 9780 ggaacatttg cagttttagg aggagccaag gagatgccat gggcctatga gaggattttc 9840 aagtatgaca ttacagatgt cgcccacaaa ctgaaaatgc accatgatga tacattcaac 9900 ttcagacttg aaattcaatc ttacagtggc gaaacactaa catcacatct tcctgaacct 9960 ctcataatcc acagaccagc caacaaggac tatgatgtct tgattattcc attaggaagc 10020 ggacgcaagc ttcctcctaa agtcattgtc aagagaggaa cagaaattga gtttcatcca 10080 gtcgatgaca cgatcaacag acctgttgta gatcttggca gctacactgc tctctacaac 10140 tgtgtggtgc cacccttcac ataccacgga tatgagctcg atcatgccta ctcccttagg 10200 gatgggcact actatattgc aggcccaacc aaagacctct gcgccagcgg aaatgttaga 10260 atacatatcc acattgaaga cgagtaactt tacagaaata atcatcaaca tcagactact 10320 ttcatttgat ggtcatgtcc acttcaagat gcttcatcct tccaactttg tcagtccaat 10380 tgatctactg ctgactaacc gtttatactt acattgaaac agctgtaact gcttttcagt 10440 attcaaaata aatgtttaat tttttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa 10495 11 10495 DNA Megathura crenulata SEQ ID NO11 is the complementary sequence of the sequence of SEQ ID NO10 11 tttttttttt tttttttttt tttttttttt aaaaaattaa acatttattt tgaatactga 60 aaagcagtta cagctgtttc aatgtaagta taaacggtta gtcagcagta gatcaattgg 120 actgacaaag ttggaaggat gaagcatctt gaagtggaca tgaccatcaa atgaaagtag 180 tctgatgttg atgattattt ctgtaaagtt actcgtcttc aatgtggata tgtattctaa 240 catttccgct ggcgcagagg tctttggttg ggcctgcaat atagtagtgc ccatccctaa 300 gggagtaggc atgatcgagc tcatatccgt ggtatgtgaa gggtggcacc acacagttgt 360 agagagcagt gtagctgcca agatctacaa caggtctgtt gatcgtgtca tcgactggat 420 gaaactcaat ttctgttcct ctcttgacaa tgactttagg aggaagcttg cgtccgcttc 480 ctaatggaat aatcaagaca tcatagtcct tgttggctgg tctgtggatt atgagaggtt 540 caggaagatg tgatgttagt gtttcgccac tgtaagattg aatttcaagt ctgaagttga 600 atgtatcatc atggtgcatt ttcagtttgt gggcgacatc tgtaatgtca tacttgaaaa 660 tcctctcata ggcccatggc atctccttgg ctcctcctaa aactgcaaat gttcctacat 720 attcatcgtc atggccttct ctgattaagt atatcttaac attagctgaa gtgtatattc 780 cagaaagaac aaaaccagca aagattcttg gttgggtcct caaatcgtgg atgatggcgt 840 ttatttcctg aatactcttt cctcgtacat ccaggttgtc atagtgataa tcaaacctgt 900 gactgtcaaa cactgttgta ggtttagcgt ttgctcgagt gaagccatct tggttaacag 960 actcgtagtc aaaaggatgc aaaggttcat caatcatgtg gccagcacat ttggctgcat 1020 tatatggttt ctttctgagc ttttgcagct tttgccaaat gatccagagt ctgtcgagac 1080 cagaatgcaa gatcataaag actggatcaa acgctgaata ttcaagagta gccatggagt 1140 aggtttcagt tcccccgagc cagatatgta ctgcattgtg aaaattttca tactgaacct 1200 caaagtcaca gtagttatcc tcttctagtg ttgaaagggc aaggtagtat agatagtcat 1260 atccatgaac ctgtggttga ttgaagagaa gatcattcgg ctgtctcgtt gtatcatgtc 1320 cagctttcat aatgtgatat ttgtagaaag ggttgttatt ggaagaatcg gcaaagaaag 1380 ctggaagagc agtcattgac ttggtccaat cccaataggg aacacccaaa tgtgaaccat 1440 gttctttgag tgctctctca aactggatag taagaagtct gtgccagtgt ggaaatacag 1500 gcattccatg gacacagcag gcatacttgt catcgccgtt gtcaggacag aggtagggat 1560 agccatggaa accagctatt gattcatatc cactgaggct ttggtcgttc tgcagcttgt 1620 acagtgcatt tttcagatca ttggcttcat ccaaggaaag agaatctatg tccttcctta 1680 caaatccagt atgttcagga tgttggccac cctccccacc atgaccctgg tgatctgctt 1740 gatgatgctc aattgttggg ctcttgatac tatccgatgg cagttctgtt ccatctgtgc 1800 caacaacctt caacgcgaaa tgaaactcag aatcatagta aagattcatc tgcttgagaa 1860 cctttgtgat atcataacgg aacaatcttt caaaggacca gggcatctcc agctccccac 1920 caagtaccgc gaaggttcca gcaaaggcac aatgaccatc aggtgtgcag acatcaaacg 1980 aaacatcagc actggcgcca aatccgtgca acaggaaggc tgcaaatgtt ctctcttcct 2040 tctttctctc ttgaagcatg gcatcaagtt ctgagataga atgtccgtgg aaggtaagtg 2100 tgtcatattg ataattgagt ctctcataat caaatgaatc gactgcccta gaattggctc 2160 tggtgactgg gtttggattc ctggactcgc tgaatggttt aagaggttgt ttcagaacct 2220 ggatatcgca atgagctgcg ttgtattgaa aacctctgta tttctggagt gcctgccaga 2280 tggcccagat acgatcagtg ttggaatggt ggagataaaa gagtggatca tatgctgtat 2340 attccagtga tgacattcca tatggagtat gtcctccagt ccaagagtgg attgcattgt 2400 gtgttatttc aaactgaact tcaaattggc agaagtctgt ctgttctaga gccaagagaa 2460 cctgtctata gaagaatgat tctgacccgt gttcagggtc attgaacagc atgtctctcg 2520 gagatcgaga tgtatccaca ttccgatgag caatatgtcc gtggtggaag ggattgtgct 2580 cgtggtcagt cactagggca ggcagatgac taaacgcact tgtccaatcc cagtatggta 2640 tgccaatgtg agctccatgc gcagccagtg catcctccat ctgtttcaca aacagtctgt 2700 gccagtgagg gaaagatgcc attccatgag tacaacatgc aacgtctcta ccatcaggca 2760 tgtgacacat gggaggactt ccgtgataag cagcaattgc ctggaatcca ttgggtccat 2820 catcatccat cacgctttgc agagcatgcc tcaggttctc tatctcagat gcggtaagag 2880 acgtcacgtc tttcctgact ccactgccag caatatgatc tgagccgtga tcatcggtat 2940 gaccttcacc agctgagtag attatagtag gtgctgggat agctgaagat gggagagcag 3000 tcttgtttac accatggatc tccactttaa tgaagaaagg agcatcagtg tcaaacacat 3060 cttcagggtg tatggactga gccttcaaag cttgggtgat gtcgtattta aagttcctgt 3120 cataggccca gtgcatctct ttagaacctc ctaagataaa gatcatgcca gatgcgtggc 3180 agtcgtcagc tgtcttgcaa atccaaaatc gtacatcagc tgatcttcca atgccatgta 3240 gaaggaaccc ggcaaagact cttgtttggc tttttcgccg gttgatttcc gcttccaact 3300 gattaacatt catgccagca atttctaagt tgtcatacgt gtagaaaagg tcctgaaagt 3360 caaatagagt gttggggact gcgtgcattt ttgtgaatcc attgtgattg aaatcgtgtg 3420 caaatggtcg cattggtttg gtcatcatgt taagagcaca atccgcttta tgatagggac 3480 gctttctctt cttttgcaga gcttgccaga ctgcccatat tttgtcaaca aaggagtgat 3540 gaataaagaa ggctgggtca tatgaagcat agtgtagaga agaaagagca taaacttgtc 3600 gaccacccac caagtagtga atagcattat ggacgacctc aaattgaacc tcaaaatcac 3660 agaaatctgt ctgttctaag gctaaaagag tttgatctaa caaaacagat gttcccgttt 3720 cgtcctgcaa gtggtacaaa atgtcctgag gatccctaac cgtgtaagcg tcctcggatt 3780 taatgtagcc tttagcaaat ggattattga caactgcatc cctccaagga tcgtaatatt 3840 cagggctggt gaagagatca ggcagtttgc tgataggttt ggtccagtcc cagtatggaa 3900 tagcaacagc agatccgtga gataacagag ccatgtccat ttgcagggtg taaagcctgt 3960 gccagtgtgg gaatactggc attccgtgga tacaacaggc aatctcatgt ccctcatcat 4020 catgacaaga agccgggaga ccgtggaatg atgcaatggc ttgataaccg ttcaccccat 4080 cgtcagcttg tagagacctc attgcagatt tcaggctggc gagatctctc tcggtgagtt 4140 cagatagttc catacgaatc cgattacgac caacgtgacg tacattgatc ccatgctttc 4200 cttcttcaaa gagaacagaa ggagttggaa tcagatcgct gtccaactca gctccagcta 4260 tatcctttat gtgaaccttg atggtaaatg cgtcatcgta tcgcagtttc atgtctttca 4320 acgtttttgt aatgtcgtac ttgtaaagcc tgtcaaactg ccacggcatt tctgaacttc 4380 caccgagaac tgagaagtat ccagcttctt ggcaactctg gtctggtctg cagatatcaa 4440 acgatacgga tgctgattgt ccaaatcctt taagaaggaa tcctgcgaag actctatcag 4500 cttcctgttg ttgttttata tagttatgta aacttgaaac agacatgcca acaaattcga 4560 gggaatcata ctcatacccg aatcggtgat aatcaaatgt gtcttcaggc ttggagaact 4620 cttgagtgcg tttgttcaaa ttatagggac ttccaaagct gaaaggtttg agagggtctt 4680 tcatttgctc cagggcgcag tgtgcttcct tccctgaaag acctctgtgc tcctggaaag 4740 cttgccagat agcccaaatg cgatctgtct gcgaatggtg gatatagaac agtggatcgt 4800 atgatgcgta atgaagatga cctattgaat gggtctttga tcctccaacc catgaatgaa 4860 ttccattatg caaaatttcg aactgtattt caaagtcaca gtaattttct tgttctagtg 4920 catatagtgc ctgattgtaa aaccagttgt tgtaaaccag atttcctttt gcaaagaggt 4980 gttcagcatc aacatccctc tctgttgtta ctccctctcc ctcaaattct atatttgcct 5040 tgaggaacgg atttgggaag gttttgtgtg aaaacaggtg ctcgtatgtc tcctgtgtca 5100 ccaaatcagg gagtgcagac tgcggtttca gccaatccca atatggaagt ccgacaatag 5160 aaccatgtcg tttcaaagaa tcttcgaact gaactgtgta gagtctgtgc cactgaggga 5220 aggtagccat accatggagg caacaagcgt gtctatgtgc agcagatgga cttggacaaa 5280 gaggaggcaa agcgtggaaa gaagcaatag cttgatatcc gtctggagaa tcatctgctt 5340 gcatgttctt gagagctttc ctcaaaaaat gtttctccag tgttgtaaga gagcttactt 5400 cttttcgcac aagattatgt tgtgaaagcc catctgcagt ttgatgtgaa tgttcaccag 5460 gaacaaaaag cagactcggt tcgttgagga tgtgtggatc aagatgcgat ccattgacag 5520 ctctgacgtc aactttgatg tcgaatccat gtccgtccaa gggaacacca agtttatgca 5580 gagcagacgt tatgtcaaat ttgtacaagc ggtcaaagtg gaatggcatt tctgtttctc 5640 cccccagaat cgtgaaggaa cccgcctttg tcttgcagtt ttgttctcct ccttgttcaa 5700 cgcaaatata aatatctaca acaacagatg tcccaatgtt gtgaagaata aagccagcaa 5760 aaactctgtc gtgttgtttt ctagcctgaa tagtttggtc tagcttttgt atgctgaagt 5820 ggttaaattg aaggttatca tattggtacc tgaagcggtt ttgataatca aaactgtctt 5880 gtgggaggtt gtgtttaagc gtcatggaat cactgttaag ttctgggtta ttaaatggtt 5940 gaagaggttt acgcatctcg ttgactgcgc agtcagcctc attgtatggt ttctttctat 6000 atctctgcaa gtcctgccag atggcccaga ttctgtcaac gtttgcatgg tgaaggaaaa 6060 atacaggatc aaatgcggta taatcaagag acgaaaggga gtatttagcc cttcctccaa 6120 gtctagaatg aagggcgttg tgtatcagct caagctgaat ctcaaagtca cagaagttgt 6180 cttgctcaag agccagcatg acattctcgt agaagtcctt attgtcccat agttctggct 6240 gaggatctct ggacgtcaca gcattctcga aggcgatatg tcccctgaag aaaggattag 6300 gatcaaaggt ctgggatcgt gaattgaaat aagttgcatc attgattaat gatggcagag 6360 agtcagcttt ctcggtccag tcccagtaag gaacagcaac tgcagaccct cgttctaaga 6420 gcgcattctc cacctgaaga acgtacagtc tgtgccagtg gggaaaggtg ggcatgccat 6480 ggacacaaca agcaacagga tgtccatcat attcacaaag tcctggtttt ccatggaact 6540 ttgcaatgtt ttcatatata ccctcttttt gaatttggag gaaagcagat ctgatgctct 6600 caatttctcc ctctgagagg tcccggatat tttttcttat gtggctagcg cttgttactg 6660 cctcacgata ttcatcagcc tggtgtgaac ctgcagttgg ctcgaaaatc accgaaggct 6720 gagaaaagat acttccacca agacttccgc cattcagatc aaaggcttca tatttaagaa 6780 agaaattatc tccaacatgt agatctaaat cgtgcagctg ctgtgttatc tcgtacttgt 6840 acacacggtc atacctccaa ggcatctcag cttcatctcc caaaatgtag aattctccag 6900 cataatggtc acagtcagat acattgtgtt tgcaaacgta gaatttcacg agtgcagact 6960 gtccaatttc atgaagaagg aatcctgcaa agactctgtc atgtgacttt cttctctgca 7020 actctcggtc cagttgggca atagacagac cattaaatgc caggctctca tactcatagt 7080 cgaagttgtt cttgtagtcg aagactcgga aaggaaccga atgttcacga gtttcgtcat 7140 ctggatttat gacactatca agaccaaatg gctgaagtgg ttttctcatg gatgcaatgg 7200 cacagtttgc agtgttgtat ggtttccccc ggtatttttg caaagcttgc caaatggccc 7260 aaagtcggtc tgtattggag tggtgcaaga agaagattgg atcgtatgta gtatacctca 7320 aagatgacat actgtatggt tcgttaccac caactagagc atgtatgaaa ttatgggcaa 7380 tttcaaattg tacctcaaaa tcacagaaat cgtcctgctc aaaggccaga aggacttgtt 7440 ttgcaatatc tgtgtagtga ccgaaaccag gagattgata aagatcatca cgcacagctc 7500 ttgtagtatc atgcccaaca gaatctatgt gaccatgata ccatgggtta tcttcttcga 7560 ggctggaaac agaatcattg taagtaggat cagcaacaaa atgtggaagg gcgctcattg 7620 gtcgagtcca gtcccaatag ggcaatccac cagtaaatcc atgtttcctc agagcatttt 7680 ctccttgaac tgtcagcaat ctgtgccagt gagggaaaac agccattccg tgaacgcagc 7740 atgcaaattt cttctccgct tcagggcgtg ggcaccattt tggttctccg tgaaatgctg 7800 caatctgttg gaatccacct cgtgttttgt catcttggag atctgccata gctcgcctca 7860 acgaattttg ttcttcagct gtcagggaat gaacgttctt cctgattttt ttgctgtggc 7920 caaagtcttt ggcaacaata tgcccaggtt caaacatgag agcaggagac ggaattagct 7980 ctggtgacaa ggaagttcca tcaacatcag tgatcttaac agaaatagtg gcatcattca 8040 catcaatgtt cagctttttg gcagcatcag taatttcata tttgtacatt ctgtcgaaag 8100 cccatttcat ttccttagat ccaccaagca ctgcaaagct tccaatcgtg tgggcctctt 8160 ggccctttgc actgataagg acgtctacgt ttgctgatgt tccaatatga gaaaggaaga 8220 atcctgcaaa acttctgtca tgctgtttct gcttgtgtat atatttttca agttctgaca 8280 gggtcatgcc accaaactgg aggtcatcat atcgatagtc caaaactccc tcatagtcgt 8340 aaacattagt tggaacagag ttttggtagg ttttatcgtt gttgttgaga ggtgaactga 8400 atgaaaaagg ctttaatggt tccctttcct gtgaccaggc acaatgggcc ttgtaaggtt 8460 tgttcctcct gatttgaaga gcttgccaaa tggcccagag acgatcaaca tttgagtgat 8520 ggaggtagaa aataggatca aaagcagtgt agtgaagaga agacatggag tacttttcac 8580 ttcctccaac ccacgcatgt atgttattgt gaacgagttc aaactgaact tcaaagtcgc 8640 aaaagtcttc ttgttcaaaa gcatagacca tagcgtcaaa cagatgcgtg tggtgtccaa 8700 aagacggttt ctcaaagagt ctattgtctg tatctctgtg tgtgacgtta ttttcaaaag 8760 ccacttcggc atggtaaaat ggatttgcca tgattgcgct agaaactgga tggggatagg 8820 tctcactcat agccagttct ggcagatgat ccattggcat ggtccagtcc cagtaaggta 8880 gaccgatagt agacccgcga cgaagtagag catcttcaac ttgagtgaca aagagtctat 8940 gccagtgggg gaaggttgcc atgccatgaa cgcagcatgc aaatctatct ttagcatctg 9000 ggcgtggaca tcttgctgga agggcatgga attctactgt agcttggtac ccgtcaactg 9060 agtcattctg gaatctttcc atagctttcc tcaagctgta tgtttcctct gcagttagat 9120 cgttgatatt ctttctgaca gccaggttag cttgagcaac tggaactgga ggcttatcaa 9180 aatgaccttt agctggtcgg tgcactaagg tgggttgtcg aagaatgtca tttgacaaag 9240 cgcttccgtt gacgctgaat aactcggact tcacgtaata atgcccatga ccatcatcat 9300 ctatagaagc aacgctgtca gtaacatcaa agtggtatgc cctgtagaat ccccacttca 9360 tttcagttgg tccgccaaga aggaagaaat ctcctgcttt gatacattga tcatcagcga 9420 tttcatcatg agcaggtcgg cagacataga ctacaacatt ggcagagcca ccaaaaccag 9480 ataatatgaa gctggcaaat gtacgctcct tagctgctcg ttctcctaga tatacgttga 9540 gttcctctgg actcatgcca ttcaaggaga ggtcgtcata ctggtagccg aggcgcccgt 9600 agtcaaagct atcagcaggt ttggcgtgct ctcttgtcag atccagtgga ttagtgtctc 9660 tgtcgaacgg ttccattgga gtgtgaataa ggttatgggc acagtcagca ctgttgggat 9720 ccttgcctct gagctgttga agtcgttgcc atatagcaaa gatacggtca gtgttgaagt 9780 gatgtaaaaa gaacaaagga tcgtatgacg tgtactcgag gtgagacatg gaatatgtgt 9840 gtctgccccc aactagataa tgtatggcgt tgtgagccag ctcaaactgg atctcaaatt 9900 tgcaaaactc gtcctgttcc agagcgtcaa ggattccctc catgaggcga gtgtggtgtc 9960 ctggctgtac tttttcaaaa agacggtcat ctacagcacg acttgtcttc ttgttgagga 10020 aagtgatgtc agctcggtac caagaattgt catgagcctt gcccccctca gtgtcgacaa 10080 acagtggatg agacacaagt tctggaagat gggtgagctc actagtccaa tcccagtagg 10140 ggatgctgat ggttgccttc ttccttatca gcgctctttc caactggaca acgtagagac 10200 gatgccaaag gggaaaggat ggcataccgt gaagacaaca ggccacattt ttttcatgtt 10260 tgtccacaca cattgttgga acaccatggt agcctgctat ggcctgataa ccctggttgc 10320 tgtcgtcttg ctggagatca tcgagagcct tctcgagggc tagaacctcg tcactgctca 10380 aactgtccac attctttctc accactgtgt ctaccgagaa ggctccctca aacaagaggg 10440 tcgccgtgag gagagccaag atggtccaca tgatcaaacc agatggctgg tgatc 10495 12 3420 PRT Megathura crenulata SEQ ID NO12 is the deduced amino acid sequence of the coding region of SEQ ID NO10 12 Met Trp Thr Ile Leu Ala Leu Leu Thr Ala Thr Leu Leu Phe Glu Gly 1 5 10 15 Ala Phe Ser Val Asp Thr Val Val Arg Lys Asn Val Asp Ser Leu Ser 20 25 30 Ser Asp Glu Val Leu Ala Leu Glu Lys Ala Leu Asp Asp Leu Gln Gln 35 40 45 Asp Asp Ser Asn Gln Gly Tyr Gln Ala Ile Ala Gly Tyr His Gly Val 50 55 60 Pro Thr Met Cys Val Asp Lys His Glu Lys Asn Val Ala Cys Cys Leu 65 70 75 80 His Gly Met Pro Ser Phe Pro Leu Trp His Arg Leu Tyr Val Val Gln 85 90 95 Leu Glu Arg Ala Leu Ile Arg Lys Lys Ala Thr Ile Ser Ile Pro Tyr 100 105 110 Trp Asp Trp Thr Ser Glu Leu Thr His Leu Pro Glu Leu Val Ser His 115 120 125 Pro Leu Phe Val Asp Thr Glu Gly Gly Lys Ala His Asp Asn Ser Trp 130 135 140 Tyr Arg Ala Asp Ile Thr Phe Leu Asn Lys Lys Thr Ser Arg Ala Val 145 150 155 160 Asp Asp Arg Leu Phe Glu Lys Val Gln Pro Gly His His Thr Arg Leu 165 170 175 Met Glu Gly Ile Leu Asp Ala Leu Glu Gln Asp Glu Phe Cys Lys Phe 180 185 190 Glu Ile Gln Phe Glu Leu Ala His Asn Ala Ile His Tyr Leu Val Gly 195 200 205 Gly Arg His Thr Tyr Ser Met Ser His Leu Glu Tyr Thr Ser Tyr Asp 210 215 220 Pro Leu Phe Phe Leu His His Phe Asn Thr Asp Arg Ile Phe Ala Ile 225 230 235 240 Trp Gln Arg Leu Gln Gln Leu Arg Gly Lys Asp Pro Asn Ser Ala Asp 245 250 255 Cys Ala His Asn Leu Ile His Thr Pro Met Glu Pro Phe Asp Arg Asp 260 265 270 Thr Asn Pro Leu Asp Leu Thr Arg Glu His Ala Lys Pro Ala Asp Ser 275 280 285 Phe Asp Tyr Gly Arg Leu Gly Tyr Gln Tyr Asp Asp Leu Ser Leu Asn 290 295 300 Gly Met Ser Pro Glu Glu Leu Asn Val Tyr Leu Gly Glu Arg Ala Ala 305 310 315 320 Lys Glu Arg Thr Phe Ala Ser Phe Ile Leu Ser Gly Phe Gly Gly Ser 325 330 335 Ala Asn Val Val Val Tyr Val Cys Arg Pro Ala His Asp Glu Ile Ala 340 345 350 Asp Asp Gln Cys Ile Lys Ala Gly Asp Phe Phe Leu Leu Gly Gly Pro 355 360 365 Thr Glu Met Lys Trp Gly Phe Tyr Arg Ala Tyr His Phe Asp Val Thr 370 375 380 Asp Ser Val Ala Ser Ile Asp Asp Asp Gly His Gly His Tyr Tyr Val 385 390 395 400 Lys Ser Glu Leu Phe Ser Val Asn Gly Ser Ala Leu Ser Asn Asp Ile 405 410 415 Leu Arg Gln Pro Thr Leu Val His Arg Pro Ala Lys Gly His Phe Asp 420 425 430 Lys Pro Pro Val Pro Val Ala Gln Ala Asn Leu Ala Val Arg Lys Asn 435 440 445 Ile Asn Asp Leu Thr Ala Glu Glu Thr Tyr Ser Leu Arg Lys Ala Met 450 455 460 Glu Arg Phe Gln Asn Asp Ser Val Asp Gly Tyr Gln Ala Thr Val Glu 465 470 475 480 Phe His Ala Leu Pro Ala Arg Cys Pro Arg Pro Asp Ala Lys Asp Arg 485 490 495 Phe Ala Cys Cys Val His Gly Met Ala Thr Phe Pro His Trp His Arg 500 505 510 Leu Phe Val Thr Gln Val Glu Asp Ala Leu Leu Arg Arg Gly Ser Thr 515 520 525 Ile Gly Leu Pro Tyr Trp Asp Trp Thr Met Pro Met Asp His Leu Pro 530 535 540 Glu Leu Ala Met Ser Glu Thr Tyr Pro His Pro Val Ser Ser Ala Ile 545 550 555 560 Met Ala Asn Pro Phe Tyr His Ala Glu Val Ala Phe Glu Asn Asn Val 565 570 575 Thr His Arg Asp Thr Asp Asn Arg Leu Phe Glu Lys Pro Ser Phe Gly 580 585 590 His His Thr His Leu Phe Asp Ala Met Val Tyr Ala Phe Glu Gln Glu 595 600 605 Asp Phe Cys Asp Phe Glu Val Gln Phe Glu Leu Val His Asn Asn Ile 610 615 620 His Ala Trp Val Gly Gly Ser Glu Lys Tyr Ser Met Ser Ser Leu His 625 630 635 640 Tyr Thr Ala Phe Asp Pro Ile Phe Tyr Leu His His Ser Asn Val Asp 645 650 655 Arg Leu Trp Ala Ile Trp Gln Ala Leu Gln Ile Arg Arg Asn Lys Pro 660 665 670 Tyr Lys Ala His Cys Ala Trp Ser Gln Glu Arg Glu Pro Leu Lys Pro 675 680 685 Phe Ser Phe Ser Ser Pro Leu Asn Asn Asn Asp Lys Thr Tyr Gln Asn 690 695 700 Ser Val Pro Thr Asn Val Tyr Asp Tyr Glu Gly Val Leu Asp Tyr Arg 705 710 715 720 Tyr Asp Asp Leu Gln Phe Gly Gly Met Thr Leu Ser Glu Leu Glu Lys 725 730 735 Tyr Ile His Lys Gln Lys Gln His Asp Arg Ser Phe Ala Gly Phe Phe 740 745 750 Leu Ser His Ile Gly Thr Ser Ala Asn Val Asp Val Leu Ile Ser Ala 755 760 765 Lys Gly Gln Glu Ala His Thr Ile Gly Ser Phe Ala Val Leu Gly Gly 770 775 780 Ser Lys Glu Met Lys Trp Ala Phe Asp Arg Met Tyr Lys Tyr Glu Ile 785 790 795 800 Thr Asp Ala Ala Lys Lys Leu Asn Ile Asp Val Asn Asp Ala Thr Ile 805 810 815 Ser Val Lys Ile Thr Asp Val Asp Gly Thr Ser Leu Ser Pro Glu Leu 820 825 830 Ile Pro Ser Pro Ala Leu Met Phe Glu Pro Gly His Ile Val Ala Lys 835 840 845 Asp Phe Gly His Ser Lys Lys Ile Arg Lys Asn Val His Ser Leu Thr 850 855 860 Ala Glu Glu Gln Asn Ser Leu Arg Arg Ala Met Ala Asp Leu Gln Asp 865 870 875 880 Asp Lys Thr Arg Gly Gly Phe Gln Gln Ile Ala Ala Phe His Gly Glu 885 890 895 Pro Lys Trp Cys Pro Arg Pro Glu Ala Glu Lys Lys Phe Ala Cys Cys 900 905 910 Val His Gly Met Ala Val Phe Pro His Trp His Arg Leu Leu Thr Val 915 920 925 Gln Gly Glu Asn Ala Leu Arg Lys His Gly Phe Thr Gly Gly Leu Pro 930 935 940 Tyr Trp Asp Trp Thr Arg Pro Met Ser Ala Leu Pro His Phe Val Ala 945 950 955 960 Asp Pro Thr Tyr Asn Asp Ser Val Ser Ser Leu Glu Glu Asp Asn Pro 965 970 975 Trp Tyr His Gly His Ile Asp Ser Val Gly His Asp Thr Thr Arg Ala 980 985 990 Val Arg Asp Asp Leu Tyr Gln Ser Pro Gly Phe Gly His Tyr Thr Asp 995 1000 1005 Ile Ala Lys Gln Val Leu Leu Ala Phe Glu Gln Asp Asp Phe Cys 1010 1015 1020 Asp Phe Glu Val Gln Phe Glu Ile Ala His Asn Phe Ile His Ala 1025 1030 1035 Leu Val Gly Gly Asn Glu Pro Tyr Ser Met Ser Ser Leu Arg Tyr 1040 1045 1050 Thr Thr Tyr Asp Pro Ile Phe Phe Leu His His Ser Asn Thr Asp 1055 1060 1065 Arg Leu Trp Ala Ile Trp Gln Ala Leu Gln Lys Tyr Arg Gly Lys 1070 1075 1080 Pro Tyr Asn Thr Ala Asn Cys Ala Ile Ala Ser Met Arg Lys Pro 1085 1090 1095 Leu Gln Pro Phe Gly Leu Asp Ser Val Ile Asn Pro Asp Asp Glu 1100 1105 1110 Thr Arg Glu His Ser Val Pro Phe Arg Val Phe Asp Tyr Lys Asn 1115 1120 1125 Asn Phe Asp Tyr Glu Tyr Glu Ser Leu Ala Phe Asn Gly Leu Ser 1130 1135 1140 Ile Ala Gln Leu Asp Arg Glu Leu Gln Arg Arg Lys Ser His Asp 1145 1150 1155 Arg Val Phe Ala Gly Phe Leu Leu His Glu Ile Gly Gln Ser Ala 1160 1165 1170 Leu Val Lys Phe Tyr Val Cys Lys His Asn Val Ser Asp Cys Asp 1175 1180 1185 His Tyr Ala Gly Glu Phe Tyr Ile Leu Gly Asp Glu Ala Glu Met 1190 1195 1200 Pro Trp Arg Tyr Asp Arg Val Tyr Lys Tyr Glu Ile Thr Gln Gln 1205 1210 1215 Leu His Asp Leu Asp Leu His Val Gly Asp Asn Phe Phe Leu Lys 1220 1225 1230 Tyr Glu Ala Phe Asp Leu Asn Gly Gly Ser Leu Gly Gly Ser Ile 1235 1240 1245 Phe Ser Gln Pro Ser Val Ile Phe Glu Pro Thr Ala Gly Ser His 1250 1255 1260 Gln Ala Asp Glu Tyr Arg Glu Ala Val Thr Ser Ala Ser His Ile 1265 1270 1275 Arg Lys Asn Ile Arg Asp Leu Ser Glu Gly Glu Ile Glu Ser Ile 1280 1285 1290 Arg Ser Ala Phe Leu Gln Ile Gln Lys Glu Gly Ile Tyr Glu Asn 1295 1300 1305 Ile Ala Lys Phe His Gly Lys Pro Gly Leu Cys Glu Tyr Asp Gly 1310 1315 1320 His Pro Val Ala Cys Cys Val His Gly Met Pro Thr Phe Pro His 1325 1330 1335 Trp His Arg Leu Tyr Val Leu Gln Val Glu Asn Ala Leu Leu Glu 1340 1345 1350 Arg Gly Ser Ala Val Ala Val Pro Tyr Trp Asp Trp Thr Glu Lys 1355 1360 1365 Ala Asp Ser Leu Pro Ser Leu Ile Asn Asp Ala Thr Tyr Phe Asn 1370 1375 1380 Ser Arg Ser Gln Thr Phe Asp Pro Asn Pro Phe Phe Arg Gly His 1385 1390 1395 Ile Ala Phe Glu Asn Ala Val Thr Ser Arg Asp Pro Gln Pro Glu 1400 1405 1410 Leu Trp Asp Asn Lys Asp Phe Tyr Glu Asn Val Met Leu Ala Leu 1415 1420 1425 Glu Gln Asp Asn Phe Cys Asp Phe Glu Ile Gln Leu Glu Leu Ile 1430 1435 1440 His Asn Ala Leu His Ser Arg Leu Gly Gly Arg Ala Lys Tyr Ser 1445 1450 1455 Leu Ser Ser Leu Asp Tyr Thr Ala Phe Asp Pro Val Phe Phe Leu 1460 1465 1470 His His Ala Asn Val Asp Arg Ile Trp Ala Ile Trp Gln Asp Leu 1475 1480 1485 Gln Arg Tyr Arg Lys Lys Pro Tyr Asn Glu Ala Asp Cys Ala Val 1490 1495 1500 Asn Glu Met Arg Lys Pro Leu Gln Pro Phe Asn Asn Pro Glu Leu 1505 1510 1515 Asn Ser Asp Ser Met Thr Leu Lys His Asn Leu Pro Gln Asp Ser 1520 1525 1530 Phe Asp Tyr Gln Asn Arg Phe Arg Tyr Gln Tyr Asp Asn Leu Gln 1535 1540 1545 Phe Asn His Phe Ser Ile Gln Lys Leu Asp Gln Thr Ile Gln Ala 1550 1555 1560 Arg Lys Gln His Asp Arg Val Phe Ala Gly Phe Ile Leu His Asn 1565 1570 1575 Ile Gly Thr Ser Val Val Val Asp Ile Tyr Ile Cys Val Glu Gln 1580 1585 1590 Gly Gly Glu Gln Asn Cys Lys Thr Lys Ala Gly Ser Phe Thr Ile 1595 1600 1605 Leu Gly Gly Glu Thr Glu Met Pro Phe His Phe Asp Arg Leu Tyr 1610 1615 1620 Lys Phe Asp Ile Thr Ser Ala Leu His Lys Leu Gly Val Pro Leu 1625 1630 1635 Asp Gly His Gly Phe Asp Ile Lys Val Asp Val Arg Ala Val Asn 1640 1645 1650 Gly Ser His Leu Asp Pro His Ile Leu Asn Glu Pro Ser Leu Leu 1655 1660 1665 Phe Val Pro Gly Glu His Ser His Gln Thr Ala Asp Gly Leu Ser 1670 1675 1680 Gln His Asn Leu Val Arg Lys Glu Val Ser Ser Leu Thr Thr Leu 1685 1690 1695 Glu Lys His Phe Leu Arg Lys Ala Leu Lys Asn Met Gln Ala Asp 1700 1705 1710 Asp Ser Pro Asp Gly Tyr Gln Ala Ile Ala Ser Phe His Ala Leu 1715 1720 1725 Pro Pro Leu Cys Pro Ser Pro Ser Ala Ala His Arg His Ala Cys 1730 1735 1740 Cys Leu His Gly Met Ala Thr Phe Pro Gln Trp His Arg Leu Tyr 1745 1750 1755 Thr Val Gln Phe Glu Asp Ser Leu Lys Arg His Gly Ser Ile Val 1760 1765 1770 Gly Leu Pro Tyr Trp Asp Trp Leu Lys Pro Gln Ser Ala Leu Pro 1775 1780 1785 Asp Leu Val Thr Gln Glu Thr Tyr Glu His Leu Phe Ser His Lys 1790 1795 1800 Thr Phe Pro Asn Pro Phe Leu Lys Ala Asn Ile Glu Phe Glu Gly 1805 1810 1815 Glu Gly Val Thr Thr Glu Arg Asp Val Asp Ala Glu His Leu Phe 1820 1825 1830 Ala Lys Gly Asn Leu Val Tyr Asn Asn Trp Phe Tyr Asn Gln Ala 1835 1840 1845 Leu Tyr Ala Leu Glu Gln Glu Asn Tyr Cys Asp Phe Glu Ile Gln 1850 1855 1860 Phe Glu Ile Leu His Asn Gly Ile His Ser Trp Val Gly Gly Ser 1865 1870 1875 Lys Thr His Ser Ile Gly His Leu His Tyr Ala Ser Tyr Asp Pro 1880 1885 1890 Leu Phe Tyr Ile His His Ser Gln Thr Asp Arg Ile Trp Ala Ile 1895 1900 1905 Trp Gln Ala Phe Gln Glu His Arg Gly Leu Ser Gly Lys Glu Ala 1910 1915 1920 His Cys Ala Leu Glu Gln Met Lys Asp Pro Leu Lys Pro Phe Ser 1925 1930 1935 Phe Gly Ser Pro Tyr Asn Leu Asn Lys Arg Thr Gln Glu Phe Ser 1940 1945 1950 Lys Pro Glu Asp Thr Phe Asp Tyr His Arg Phe Gly Tyr Glu Tyr 1955 1960 1965 Asp Ser Leu Glu Phe Val Gly Met Ser Val Ser Ser Leu His Asn 1970 1975 1980 Tyr Ile Lys Gln Gln Gln Glu Ala Asp Arg Val Phe Ala Gly Phe 1985 1990 1995 Leu Leu Lys Gly Phe Gly Gln Ser Ala Ser Val Ser Phe Asp Ile 2000 2005 2010 Cys Arg Pro Asp Gln Ser Cys Gln Glu Ala Gly Tyr Phe Ser Val 2015 2020 2025 Leu Gly Gly Ser Ser Glu Met Pro Trp Gln Phe Asp Arg Leu Tyr 2030 2035 2040 Lys Tyr Asp Ile Thr Lys Thr Leu Lys Asp Met Lys Leu Arg Tyr 2045 2050 2055 Asp Asp Ala Phe Thr Ile Lys Val His Ile Lys Asp Ile Ala Gly 2060 2065 2070 Ala Glu Leu Asp Ser Asp Leu Ile Pro Thr Pro Ser Val Leu Phe 2075 2080 2085 Glu Glu Gly Lys His Gly Ile Asn Val Arg His Val Gly Arg Asn 2090 2095 2100 Arg Ile Arg Met Glu Leu Ser Glu Leu Thr Glu Arg Asp Leu Ala 2105 2110 2115 Ser Leu Lys Ser Ala Met Arg Ser Leu Gln Ala Asp Asp Gly Val 2120 2125 2130 Asn Gly Tyr Gln Ala Ile Ala Ser Phe His Gly Leu Pro Ala Ser 2135 2140 2145 Cys His Asp Asp Glu Gly His Glu Ile Ala Cys Cys Ile His Gly 2150 2155 2160 Met Pro Val Phe Pro His Trp His Arg Leu Tyr Thr Leu Gln Met 2165 2170 2175 Asp Met Ala Leu Leu Ser His Gly Ser Ala Val Ala Ile Pro Tyr 2180 2185 2190 Trp Asp Trp Thr Lys Pro Ile Ser Lys Leu Pro Asp Leu Phe Thr 2195 2200 2205 Ser Pro Glu Tyr Tyr Asp Pro Trp Arg Asp Ala Val Val Asn Asn 2210 2215 2220 Pro Phe Ala Lys Gly Tyr Ile Lys Ser Glu Asp Ala Tyr Thr Val 2225 2230 2235 Arg Asp Pro Gln Asp Ile Leu Tyr His Leu Gln Asp Glu Thr Gly 2240 2245 2250 Thr Ser Val Leu Leu Asp Gln Thr Leu Leu Ala Leu Glu Gln Thr 2255 2260 2265 Asp Phe Cys Asp Phe Glu Val Gln Phe Glu Val Val His Asn Ala 2270 2275 2280 Ile His Tyr Leu Val Gly Gly Arg Gln Val Tyr Ala Leu Ser Ser 2285 2290 2295 Leu His Tyr Ala Ser Tyr Asp Pro Ala Phe Phe Ile His His Ser 2300 2305 2310 Phe Val Asp Lys Ile Trp Ala Val Trp Gln Ala Leu Gln Lys Lys 2315 2320 2325 Arg Lys Arg Pro Tyr His Lys Ala Asp Cys Ala Leu Asn Met Met 2330 2335 2340 Thr Lys Pro Met Arg Pro Phe Ala His Asp Phe Asn His Asn Gly 2345 2350 2355 Phe Thr Lys Met His Ala Val Pro Asn Thr Leu Phe Asp Phe Gln 2360 2365 2370 Asp Leu Phe Tyr Thr Tyr Asp Asn Leu Glu Ile Ala Gly Met Asn 2375 2380 2385 Val Asn Gln Leu Glu Ala Glu Ile Asn Arg Arg Lys Ser Gln Thr 2390 2395 2400 Arg Val Phe Ala Gly Phe Leu Leu His Gly Ile Gly Arg Ser Ala 2405 2410 2415 Asp Val Arg Phe Trp Ile Cys Lys Thr Ala Asp Asp Cys His Ala 2420 2425 2430 Ser Gly Met Ile Phe Ile Leu Gly Gly Ser Lys Glu Met His Trp 2435 2440 2445 Ala Tyr Asp Arg Asn Phe Lys Tyr Asp Ile Thr Gln Ala Leu Lys 2450 2455 2460 Ala Gln Ser Ile His Pro Glu Asp Val Phe Asp Thr Asp Ala Pro 2465 2470 2475 Phe Phe Ile Lys Val Glu Ile His Gly Val Asn Lys Thr Ala Leu 2480 2485 2490 Pro Ser Ser Ala Ile Pro Ala Pro Thr Ile Ile Tyr Ser Ala Gly 2495 2500 2505 Glu Gly His Thr Asp Asp His Gly Ser Asp His Ile Ala Gly Ser 2510 2515 2520 Gly Val Arg Lys Asp Val Thr Ser Leu Thr Ala Ser Glu Ile Glu 2525 2530 2535 Asn Leu Arg His Ala Leu Gln Ser Val Met Asp Asp Asp Gly Pro 2540 2545 2550 Asn Gly Phe Gln Ala Ile Ala Ala Tyr His Gly Ser Pro Pro Met 2555 2560 2565 Cys His Met Pro Asp Gly Arg Asp Val Ala Cys Cys Thr His Gly 2570 2575 2580 Met Ala Ser Phe Pro His Trp His Arg Leu Phe Val Lys Gln Met 2585 2590 2595 Glu Asp Ala Leu Ala Ala His Gly Ala His Ile Gly Ile Pro Tyr 2600 2605 2610 Trp Asp Trp Thr Ser Ala Phe Ser His Leu Pro Ala Leu Val Thr 2615 2620 2625 Asp His Glu His Asn Pro Phe His His Gly His Ile Ala His Arg 2630 2635 2640 Asn Val Asp Thr Ser Arg Ser Pro Arg Asp Met Leu Phe Asn Asp 2645 2650 2655 Pro Glu His Gly Ser Glu Ser Phe Phe Tyr Arg Gln Val Leu Leu 2660 2665 2670 Ala Leu Glu Gln Thr Asp Phe Cys Gln Phe Glu Val Gln Phe Glu 2675 2680 2685 Ile Thr His Asn Ala Ile His Ser Trp Thr Gly Gly His Thr Pro 2690 2695 2700 Tyr Gly Met Ser Ser Leu Glu Tyr Thr Ala Tyr Asp Pro Leu Phe 2705 2710 2715 Tyr Leu His His Ser Asn Thr Asp Arg Ile Trp Ala Ile Trp Gln 2720 2725 2730 Ala Leu Gln Lys Tyr Arg Gly Phe Gln Tyr Asn Ala Ala His Cys 2735 2740 2745 Asp Ile Gln Val Leu Lys Gln Pro Leu Lys Pro Phe Ser Glu Ser 2750 2755 2760 Arg Asn Pro Asn Pro Val Thr Arg Ala Asn Ser Arg Ala Val Asp 2765 2770 2775 Ser Phe Asp Tyr Glu Arg Leu Asn Tyr Gln Tyr Asp Thr Leu Thr 2780 2785 2790 Phe His Gly His Ser Ile Ser Glu Leu Asp Ala Met Leu Gln Glu 2795 2800 2805 Arg Lys Lys Glu Glu Arg Thr Phe Ala Ala Phe Leu Leu His Gly 2810 2815 2820 Phe Gly Ala Ser Ala Asp Val Ser Phe Asp Val Cys Thr Pro Asp 2825 2830 2835 Gly His Cys Ala Phe Ala Gly Thr Phe Ala Val Leu Gly Gly Glu 2840 2845 2850 Leu Glu Met Pro Trp Ser Phe Glu Arg Leu Phe Arg Tyr Asp Ile 2855 2860 2865 Thr Lys Val Leu Lys Gln Met Asn Leu Tyr Tyr Asp Ser Glu Phe 2870 2875 2880 His Phe Ala Leu Lys Val Val Gly Thr Asp Gly Thr Glu Leu Pro 2885 2890 2895 Ser Asp Ser Ile Lys Ser Pro Thr Ile Glu His His Gln Ala Asp 2900 2905 2910 His Gln Gly His Gly Gly Glu Gly Gly Gln His Pro Glu His Thr 2915 2920 2925 Gly Phe Val Arg Lys Asp Ile Asp Ser Leu Ser Leu Asp Glu Ala 2930 2935 2940 Asn Asp Leu Lys Asn Ala Leu Tyr Lys Leu Gln Asn Asp Gln Ser 2945 2950 2955 Leu Ser Gly Tyr Glu Ser Ile Ala Gly Phe His Gly Tyr Pro Tyr 2960 2965 2970 Leu Cys Pro Asp Asn Gly Asp Asp Lys Tyr Ala Cys Cys Val His 2975 2980 2985 Gly Met Pro Val Phe Pro His Trp His Arg Leu Leu Thr Ile Gln 2990 2995 3000 Phe Glu Arg Ala Leu Lys Glu His Gly Ser His Leu Gly Val Pro 3005 3010 3015 Tyr Trp Asp Trp Thr Lys Ser Met Thr Ala Leu Pro Ala Phe Phe 3020 3025 3030 Ala Asp Ser Ser Asn Asn Asn Pro Phe Tyr Lys Tyr His Ile Met 3035 3040 3045 Lys Ala Gly His Asp Thr Thr Arg Gln Pro Asn Asp Leu Leu Phe 3050 3055 3060 Asn Gln Pro Gln Val His Gly Tyr Asp Tyr Leu Tyr Tyr Leu Ala 3065 3070 3075 Leu Ser Thr Leu Glu Glu Asp Asn Tyr Cys Asp Phe Glu Val Gln 3080 3085 3090 Tyr Glu Asn Phe His Asn Ala Val His Ile Trp Leu Gly Gly Thr 3095 3100 3105 Glu Thr Tyr Ser Met Ala Thr Leu Glu Tyr Ser Ala Phe Asp Pro 3110 3115 3120 Val Phe Met Ile Leu His Ser Gly Leu Asp Arg Leu Trp Ile Ile 3125 3130 3135 Trp Gln Lys Leu Gln Lys Leu Arg Lys Lys Pro Tyr Asn Ala Ala 3140 3145 3150 Lys Cys Ala Gly His Met Ile Asp Glu Pro Leu His Pro Phe Asp 3155 3160 3165 Tyr Glu Ser Val Asn Gln Asp Gly Phe Thr Arg Ala Asn Ala Lys 3170 3175 3180 Pro Thr Thr Val Phe Asp Ser His Arg Phe Asp Tyr His Tyr Asp 3185 3190 3195 Asn Leu Asp Val Arg Gly Lys Ser Ile Gln Glu Ile Asn Ala Ile 3200 3205 3210 Ile His Asp Leu Arg Thr Gln Pro Arg Ile Phe Ala Gly Phe Val 3215 3220 3225 Leu Ser Gly Ile Tyr Thr Ser Ala Asn Val Lys Ile Tyr Leu Ile 3230 3235 3240 Arg Glu Gly His Asp Asp Glu Tyr Val Gly Thr Phe Ala Val Leu 3245 3250 3255 Gly Gly Ala Lys Glu Met Pro Trp Ala Tyr Glu Arg Ile Phe Lys 3260 3265 3270 Tyr Asp Ile Thr Asp Val Ala His Lys Leu Lys Met His His Asp 3275 3280 3285 Asp Thr Phe Asn Phe Arg Leu Glu Ile Gln Ser Tyr Ser Gly Glu 3290 3295 3300 Thr Leu Thr Ser His Leu Pro Glu Pro Leu Ile Ile His Arg Pro 3305 3310 3315 Ala Asn Lys Asp Tyr Asp Val Leu Ile Ile Pro Leu Gly Ser Gly 3320 3325 3330 Arg Lys Leu Pro Pro Lys Val Ile Val Lys Arg Gly Thr Glu Ile 3335 3340 3345 Glu Phe His Pro Val Asp Asp Thr Ile Asn Arg Pro Val Val Asp 3350 3355 3360 Leu Gly Ser Tyr Thr Ala Leu Tyr Asn Cys Val Val Pro Pro Phe 3365 3370 3375 Thr Tyr His Gly Tyr Glu Leu Asp His Ala Tyr Ser Leu Arg Asp 3380 3385 3390 Gly His Tyr Tyr Ile Ala Gly Pro Thr Lys Asp Leu Cys Ala Ser 3395 3400 3405 Gly Asn Val Arg Ile His Ile His Ile Glu Asp Glu 3410 3415 3420 13 15 DNA Artificial Primer 12 klh2cSns-lessdeg71 13 ttyccwcayt ggcay 15 14 15 DNA Artificial Primer 11 klh2cSns-deg71 14 ttyccncayt ggcay 15 15 15 DNA Artificial Primer 30 klh2cAnti-lessdeg223 15 ttgccaaatd ggcca 15 16 16 DNA Artificial Primer 15 klh2cSns-deg96 16 cntaytggga ytggac 16 17 17 DNA Artificial Primer 53 Klh2cAnti-deg354 17 gggatytcng cytcrtc 17 18 31 DNA Artificial 5′ RACE Primer 1 18 ccatagcgtc aaacagatgc gtgtggtgtc c 31 19 28 DNA Artificial 5′ RACE Primer 2 19 gcaaagcttg ccaaatggcc caaagtcg 28 20 23 DNA Artificial 3′ RACE Primer 1 20 gagcgccctt ccacattttg ttg 23 21 23 DNA Artificial 3′ RACE Primer 2 21 agcgcccttc cacattttgt tgc 23 22 10197 DNA Megathura crenulata 22 gagaacctgg tcagaaagag tgtggagcat ctgactcagg aagagaccct ggacctgcag 60 gctgccctgc gtgagctgca gatggactca tccagcattg gtttccagaa aatagctgct 120 gctcatggcg cgcccgcatc ttgtgtgcat aaggacactt ctatagcttg ctgtattcac 180 ggcatgccta ctttcccgca ctggcacagg gcgtacgtag ttcacatgga gcgagctctg 240 cagacgaaga ggcgtacttc tgggatgcca tactgggact ggacagaacc catcacccag 300 ttgccctctc tcgctgcgga ccctgtttac atcgattccc agggcggaaa ggctcacacc 360 aactactggt acagagggaa cattgacttt ctagacaaga aaaccaatcg tgcggtcgac 420 gaccgtcttt ttgagaaggt gaagccaggc cagcacaccc atcttatgga gagtgttttg 480 gacgcactgg agcaggatga attctgtaaa ttcgaaatcc agttcgagtt agcacataat 540 gcgattcact acctggtcgg tggaaagcat gattactcta tggcaaacct cgagtacacc 600 gcctacgatc cgatcttctt tcttcaccac tccaacgttg acagaatttt cgccatctgg 660 cagaggctcc aagagctccg aaacaaagac cccaaagcta tggattgtgc ccaagaattg 720 ttacatcaga aaatggaacc tttcagttgg gaggacaatg acataccact aacaaatgaa 780 cattccactc cagccgatct ctttgactat cgtcaacttc actacgacta tgacacgttg 840 aatcttaatg gtatgactcc agaggaactg aaaacctatt tggatgagcg gtcttcaagg 900 gcaagagcct tcgcaagctt ccgtctgaaa ggatttggag gttcagccaa tgtctttgtg 960 tacgtctgca tccccgacga caacgatcgt aatgatgacc actgtgagaa ggccggtgat 1020 ttcttcgttc tgggcggacc aagcgagatg aaatggcagt tctacaggcc atacttgttt 1080 gacttgtccg acactgttca caagatggga atgaagttgg atggacacta taccgtcaag 1140 gcagagcttt tcagtgtcaa tggtacagct cttcccgatg atttgttacc tcatccagtt 1200 gttgttcatc atccggagaa gggtttcact gatcctccgg tgaagcatca ccaaagcgcc 1260 aatctcctcg ttagaaagaa catcaatgat ctcacccgtg aagaagtgtt gaatctgaga 1320 gaggcgtttc ataaattcca agaagatcgc tccgtcgacg gttatcaagc cacagctgag 1380 tatcatggtc ttccggccag atgccctcgc cctgatgcca aggacagata tgcttgttgt 1440 gtccatggaa tgccaatctt ccctcactgg cacaggcttt ttgtcacaca agttgaagat 1500 gctttagtag gccgtggagc taccattggt atcccatact gggactggac tgaacccatg 1560 acacacattc cgggtctggc aggaaacgaa acttatgtgg attctcatgg tgcatcccac 1620 acaaatcctt ttcatagttc agtgattgca tttgaagaaa atgctcccca caccaaaaga 1680 caaatagatc aaagactctt taaacccgct acctttggac accacacaga cctgttcaac 1740 cagattttgt atgcctttga acaagaagat tactgtgact ttgaagtcca atttgagatt 1800 acccataaca cgattcacgc ttggacagga ggaagcgaac atttctcaat gtcgtcccta 1860 cattacacag ctttcgatcc tttgttttac tttcaccatt ctaacgttga tcgtctttgg 1920 gccgtttggc aagccttaca gatgagacgg cataaaccct acagggccca ctgcgccata 1980 tctctggaac atatgcatct gaaaccattc gccttttcat ctccccttaa caataacgaa 2040 aagactcatg ccaatgccat gccaaacaag atctacgact atgaaaatgt cctccattac 2100 acatacgaag atttaacatt tggaggcatc tctctggaaa acatagaaaa gatgatccac 2160 gaaaaccagc aagaagacag aatatatgcc ggttttctcc tggctggcat acgtacttca 2220 gcaaatgttg atatcttcat taaaactacc gattccgtgc aacataaggc tggaacattt 2280 gcagtgctcg gtggaagcaa ggaaatgaag tggggatttg atcgcgtttt caagtttgac 2340 atcacgcacg ttttgaaaga tctcgatctc actgctgatg gcgatttcga agttactgtt 2400 gacatcactg aagtcgatgg aactaaactt gcatccagtc ttattccaca tgcttctgtc 2460 attcgtgagc atgcacgtgt taaatttgac aaagtgccaa ggagtcgtct tattcgaaaa 2520 aatgtagacc gtttgagccc cgaggagatg aatgaacttc gtaaagccct agccttactg 2580 aaagaggaca aaagtgccgg tggatttcag cagcttggtg cattccatgg ggagccaaaa 2640 tggtgtccta gtcccgaagc atctaaaaaa tttgcctgct gtgttcacgg catgtctgtg 2700 ttccctcact ggcatcgact gttgacggtt cagagtgaaa atgctttgag acgacatggc 2760 tacgatggag ctttgccgta ctgggattgg acctctcctc ttaatcacct tcccgaactg 2820 gcagatcatg agaagtacgt cgaccctgaa gatggggtag agaagcataa cccttggttc 2880 gatggtcata tagatacagt cgacaaaaca acaacaagaa gtgttcagaa taaactcttc 2940 gaacagcctg agtttggtca ttatacaagc attgccaaac aagtactgct agcgttggaa 3000 caggacaatt tctgtgactt tgaaatccaa tatgagattg cccataacta catccatgca 3060 cttgtaggag gcgctcagcc ttatggtatg gcatcgcttc gctacactgc ttttgatcca 3120 ctattctact tgcatcactc taatacagat cgtatatggg caatatggca ggctttacag 3180 aagtacagag gaaaaccgta caacgttgct aactgtgctg ttacatcgat gagagaacct 3240 ttgcaaccat ttggcctctc tgccaatatc aacacagacc atgtaaccaa ggagcattca 3300 gtgccattca acgtttttga ttacaagacc aatttcaatt atgaatatga cactttggaa 3360 tttaacggtc tctcaatctc tcagttgaat aaaaagctcg aagcgataaa gagccaagac 3420 aggttctttg caggcttcct gttatctggt ttcaagaaat catctcttgt taaattcaat 3480 atttgcaccg atagcagcaa ctgtcacccc gctggagagt tttaccttct gggtgatgaa 3540 aacgagatgc catgggcata cgatagagtc ttcaaatatg acataaccga aaaactccac 3600 gatctaaagc tgcatgcaga agaccacttc tacattgact atgaagtatt tgaccttaaa 3660 ccagcaagcc tgggaaaaga tttgttcaag cagccttcag tcattcatga accaagaata 3720 ggtcaccatg aaggcgaagt atatcaagct gaagtaactt ctgccaaccg tattcgaaaa 3780 aacattgaaa atctgagcct tggtgaactc gaaagtctga gagctgcctt cctggaaatt 3840 gaaaacgatg gaacttacga atcaatagct aaattccatg gtagccctgg tttgtgccag 3900 ttaaatggta accccatctc ttgttgtgtc catggcatgc caactttccc tcactggcac 3960 agactgtacg tggttgtcgt tgagaatgcc ctcctgaaaa aaggatcatc tgtagctgtt 4020 ccctattggg actggacaaa acgaatcgaa catttacctc acctgatttc agacgccact 4080 tactacaatt ccaggcaaca tcactatgag acaaacccat tccatcatgg caaaatcaca 4140 cacgagaatg aaatcactac tagggatccc aaggacagcc tcttccattc agactacttt 4200 tacgagcagg tcctttacgc cttggagcag gataacttct gtgatttcga gattcagttg 4260 gagatattac acaatgcatt gcattcttta cttggtggca aaggtaaata ttccatgtca 4320 aaccttgatt acgctgcttt tgatcctgtg ttcttccttc atcacgcaac gactgacaga 4380 atctgggcaa tctggcaaga ccttcagagg ttccgaaaac ggccataccg agaagcgaat 4440 tgcgctatcc aattgatgca cacgccactc cagccgtttg ataagagcga caacaatgac 4500 gaggcaacga aaacgcatgc cactccacat gatggttttg aatatcaaaa cagctttggt 4560 tatgcttacg ataatctgga actgaatcac tactcgattc ctcagcttga tcacatgctg 4620 caagaaagaa aaaggcatga cagagtattc gctggcttcc tccttcacaa tattggaaca 4680 tctgccgatg tccatgtatt tgtatgtctc ccaactgggg aacacacgaa ggactgcagt 4740 catgaggctg gtatgttctc catcttaggc ggtcaaacgg agatgtcctt tgtatttgac 4800 agactttaca aacttgacat aactaaagcc ttgaaaaaga acggtgtgca cctgcaaggg 4860 gatttcgatc tggaaattga gattacggct gtgaatggat ctcatctaga cagtcatgtc 4920 atccactctc ccactatact gtttgaggcc ggaacagatt ctgcccacac agatgatgga 4980 cacactgaac cagtgatgat tcgcaaagat atcacacaat tggacaagcg tcaacaactg 5040 tcactggtga aagccctcga gtccatgaaa gccgaccatt catctgatgg gttccaggca 5100 atcgcttcct tccatgctct tcctcctctt tgtccatcac cagctgcttc aaagaggttt 5160 gcgtgctgcg tccatggcat ggcaacgttc ccacaatggc accgtctgta cacagtccaa 5220 ttccaagatt ctctcagaaa acatggtgca gtcgttggac ttccgtactg ggactggacc 5280 ctacctcgtt ctgaattacc agagctcctg accgtctcaa ctattcatga cccggagaca 5340 ggcagagata taccaaatcc atttattggt tctaaaatag agtttgaagg agaaaacgta 5400 catactaaaa gagatatcaa tagggatcgt ctcttccagg gatcaacaaa aacacatcat 5460 aactggttta ttgagcaagc actgcttgct cttgaacaaa ccaactactg cgacttcgag 5520 gttcagtttg aaattatgca taatggtgtt catacctggg ttggaggcaa ggagccctat 5580 ggaattggcc atctgcatta tgcttcctat gatccacttt tctacatcca tcactcccaa 5640 actgatcgta tttgggctat atggcaatcg ttgcagcgtt tcagaggact ttctggatct 5700 gaggctaact gtgctgtaaa tctcatgaaa actcctctga agcctttcag ctttggagca 5760 ccatataatc ttaatgatca cacgcatgat ttctcaaagc ctgaagatac attcgactac 5820 caaaagtttg gatacatata tgacactctg gaatttgcag ggtggtcaat tcgtggcatt 5880 gaccatattg tccgtaacag gcaggaacat tcaagggtct ttgccggatt cttgcttgaa 5940 ggatttggca cctctgccac tgtcgatttc caggtctgtc gcacagcggg agactgtgaa 6000 gatgcagggt acttcaccgt gttgggaggt gaaaaagaaa tgccttgggc ctttgatcgg 6060 ctttacaagt acgacataac agaaacctta gacaagatga accttcgaca tgacgaaatc 6120 ttccagattg aagtaaccat tacatcctac gatggaactg tactcgatag tggccttatt 6180 cccacaccgt caatcatcta tgatcctgct catcatgata ttagttcgca ccacctgtcg 6240 ctcaacaagg ttcgtcatga tctgagtaca ctgagtgagc gagatattgg aagccttaaa 6300 tatgctttga gcagcttgca ggcagatacc tcagcagatg gttttgctgc cattgcatcc 6360 ttccatggtc tgcctgccaa atgtaatgac agccacaata acgaggtggc atgctgtatc 6420 catggaatgc ctacattccc ccactggcac agactctaca ccctccaatt tgagcaagct 6480 ctaagaagac atggctctag tgtagcagta ccctactggg actggacaaa gccaatacat 6540 aatattccac atctgttcac agacaaagaa tactacgatg tctggagaaa taaagtaatg 6600 ccaaatccat ttgcccgagg gtatgtcccc tcacacgata catacacggt aagagacgtc 6660 caagaaggcc tgttccacct gacatcaacg ggtgaacact cagcgcttct gaatcaagct 6720 cttttggcgc tggaacagca cgactactgc gattttgcag tccagtttga agtcatgcac 6780 aacacaatcc attacctagt gggaggacct caagtctatt ctttgtcatc ccttcattat 6840 gcttcatatg atccgatctt cttcatacac cactcctttg tagacaaggt ttgggctgtc 6900 tggcaggctc ttcaagaaaa gagaggcctt ccatcagacc gtgctgactg cgctgttagt 6960 ctcatgactc agaacatgag gcctttccat cacgaaatta accataacca gttcaccaag 7020 aaacatgcag ttccaaatga tgttttcaag tacgaactcc tgggttacag atacgacaat 7080 ctggaaatcg gtggcatgaa tttgcatgaa attgaaaagg aaatcaaaga caaacagcac 7140 catgtgagag tgtttgcagg gttcctcctt cacggaatta gaacctcagc tgatgtccaa 7200 ttccagattt gtaaaacatc agaagattgt caccatggag gccaaatctt cgttcttggg 7260 gggactaaag agatggcctg ggcttataac cgtttattca agtacgatat tacccatgct 7320 cttcatgacg cacacatcac tccagaagac gtattccatc cctctgaacc attcttcatc 7380 aaggtgtcag tgacagccgt caacggaaca gttcttccgg cttcaatcct gcatgcacca 7440 accattatct atgaacctgg tctcgaccat cacgaagatc atcattcttc ttctatggct 7500 ggacatggtg tcagaaagga aatcaacaca cttaccactg cagaggtgga caatctcaaa 7560 gatgccatga gagccgtcat ggcagaccac ggtccaaatg gataccaggc tatagcagcg 7620 ttccatggaa acccaccaat gtgccctatg ccagatggaa agaattactc gtgttgtaca 7680 catggcatgg ctactttccc ccactggcac agactgtaca caaaacagat ggaagatgcc 7740 ttgaccgccc atggtgccag agtcggcctt ccttactggg actggacaac tgcctttaca 7800 gctttgccaa cttttgtcac agatgaagag gacaatcctt tccatcatgg tcacatagac 7860 tatttgggag tggatacaac tcggtcgccc cgagacaagt tgttcaatga tccagagcga 7920 ggatcagaat cgttcttcta caggcaggtt ctcttggctt tggagcagac agatttttgt 7980 cagtttgaag tccagttcga gattacccac aatgccgtcc attcttggac tggtggactg 8040 accccctacg gaatgtccac tctggagtac acgacctatg atcctctttt ctggcttcat 8100 catgccaata ctgaccgaat atgggctatt tggcaagctc ttcaagaata cagaggtctc 8160 ccatacgacc atgccaactg cgaaatccag gcaatgaaga ggcccctacg gccattctct 8220 gatcctatta atcacaatgc gttcactcat tctaacgcaa aaccaacaga cgttttcgag 8280 tacagtagat tcaacttcca gtatgataac ttacggtttc atggtatgac aataaaaaag 8340 ttggagcatg agcttgagaa acagaaggaa gaagacagga cattcgctgc cttcctgctc 8400 cacggcatta aaaaaagtgc tgatgtaagc tttgacgtgt gcaaccatga tggagaatgc 8460 cactttgcag gtaccttcgc tattcttgga ggcgagcacg agatgccttg gtcctttgac 8520 agacttttcc gatatgacat cacccaagtt cttaagcaaa tgcacctgga gtatgattct 8580 gatttcacgt tccatatgag aattattgac acttcgggta aacaactgcc atcagacctg 8640 atcaagatgc caacggttga acacagccca ggaggaaaac atcatgagaa acatcatgaa 8700 gatcaccatg aagacatact tgttcgtaaa aacattcact ctctgtcaca ccatgaagcc 8760 gaagaattga gagatgcatt gtacaagctt caaaacgatg aaagtcacgg tggatatgaa 8820 catattgctg gattccatgg ttatcctaac ctctgtcctg aaaaagggga tgaaaagtat 8880 ccttgctgtg ttcatggtat gtctatattc ccccactggc acaggctcca cacgatacag 8940 tttgaaagag ctctgaagaa acatggttcc catctcggta ttccttactg ggattggact 9000 cagacgatat cgtctcttcc taccttcttt gcagattctg gtaataataa tccattcttc 9060 aaataccata tcagaagcat aaatcaagac actgttcgag acgttaatga agcgatattc 9120 caacaaacga agtttggaga attttcgtcc atattttacc tggccttgca ggctctagaa 9180 gaggataatt attgcgattt tgaggtgcag tacgagatac ttcacaatga ggtgcatgcc 9240 ttgattggtg gagccgagaa gtattccatg tccacgctgg aatattcagc cttcgatcca 9300 tacttcatga tccaccacgc gtctttggac aagatctgga taatctggca ggagcttcag 9360 aaacgccggg ttaagcctgc acatgctggt tcgtgtgccg gagacattat gcacgttcca 9420 cttcatcctt tcaattatga aagcgtcaac aacgatgact ttaccagaga gaattctctc 9480 cccaacgcgg tcgttgacag tcatagattc aactataaat atgacaatct taatcttcac 9540 ggccacaata tagaggagct tgaggaggtt ctccgcagcc tgagacttaa atctcgtgtt 9600 ttcgcaggat ttgttctgtc cggaatccgc acaactgcag ttgtgaaagt gtatatcaaa 9660 agcggaaccg attccgacga cgaatatgcc ggttcgtttg tcatccttgg aggagccaaa 9720 gagatgcctt gggcatatga gaggttgtat agatttgaca ttacagaaac tgtacacaat 9780 cttaacctta cggatgatca tgttaaattc aggtttgact taaagaaata tgatcatact 9840 gaattggatg catcagtact gccagctcct ataatcgttc gacgtccaaa taatgcggtt 9900 tttgatatca ttgagattcc aatagggaaa gacgtgaatc ttccaccgaa ggtggtcgtc 9960 aagagaggca ctaaaatcat gttcatgtct gttgatgaag ccgttaccac tcccatgctg 10020 aacctgggta gttatacagc aatgttcaaa tgtaaagtgc caccattcag cttccatgct 10080 tttgagcttg ggaaaatgta ttctgtcgaa tctggtgact atttcatgac ggcatcaaca 10140 actgaattat gcaacgataa taacttaagg atacatgtcc atgtggatga cgaatag 10197 23 10197 DNA Megathura crenulata SEQ ID NO23 is the complementary sequence of the coding sequence of SEQ ID NO22 23 ctattcgtca tccacatgga catgtatcct taagttatta tcgttgcata attcagttgt 60 tgatgccgtc atgaaatagt caccagattc gacagaatac attttcccaa gctcaaaagc 120 atggaagctg aatggtggca ctttacattt gaacattgct gtataactac ccaggttcag 180 catgggagtg gtaacggctt catcaacaga catgaacatg attttagtgc ctctcttgac 240 gaccaccttc ggtggaagat tcacgtcttt ccctattgga atctcaatga tatcaaaaac 300 cgcattattt ggacgtcgaa cgattatagg agctggcagt actgatgcat ccaattcagt 360 atgatcatat ttctttaagt caaacctgaa tttaacatga tcatccgtaa ggttaagatt 420 gtgtacagtt tctgtaatgt caaatctata caacctctca tatgcccaag gcatctcttt 480 ggctcctcca aggatgacaa acgaaccggc atattcgtcg tcggaatcgg ttccgctttt 540 gatatacact ttcacaactg cagttgtgcg gattccggac agaacaaatc ctgcgaaaac 600 acgagattta agtctcaggc tgcggagaac ctcctcaagc tcctctatat tgtggccgtg 660 aagattaaga ttgtcatatt tatagttgaa tctatgactg tcaacgaccg cgttggggag 720 agaattctct ctggtaaagt catcgttgtt gacgctttca taattgaaag gatgaagtgg 780 aacgtgcata atgtctccgg cacacgaacc agcatgtgca ggcttaaccc ggcgtttctg 840 aagctcctgc cagattatcc agatcttgtc caaagacgcg tggtggatca tgaagtatgg 900 atcgaaggct gaatattcca gcgtggacat ggaatacttc tcggctccac caatcaaggc 960 atgcacctca ttgtgaagta tctcgtactg cacctcaaaa tcgcaataat tatcctcttc 1020 tagagcctgc aaggccaggt aaaatatgga cgaaaattct ccaaacttcg tttgttggaa 1080 tatcgcttca ttaacgtctc gaacagtgtc ttgatttatg cttctgatat ggtatttgaa 1140 gaatggatta ttattaccag aatctgcaaa gaaggtagga agagacgata tcgtctgagt 1200 ccaatcccag taaggaatac cgagatggga accatgtttc ttcagagctc tttcaaactg 1260 tatcgtgtgg agcctgtgcc agtgggggaa tatagacata ccatgaacac agcaaggata 1320 cttttcatcc cctttttcag gacagaggtt aggataacca tggaatccag caatatgttc 1380 atatccaccg tgactttcat cgttttgaag cttgtacaat gcatctctca attcttcggc 1440 ttcatggtgt gacagagagt gaatgttttt acgaacaagt atgtcttcat ggtgatcttc 1500 atgatgtttc tcatgatgtt ttcctcctgg gctgtgttca accgttggca tcttgatcag 1560 gtctgatggc agttgtttac ccgaagtgtc aataattctc atatggaacg tgaaatcaga 1620 atcatactcc aggtgcattt gcttaagaac ttgggtgatg tcatatcgga aaagtctgtc 1680 aaaggaccaa ggcatctcgt gctcgcctcc aagaatagcg aaggtacctg caaagtggca 1740 ttctccatca tggttgcaca cgtcaaagct tacatcagca ctttttttaa tgccgtggag 1800 caggaaggca gcgaatgtcc tgtcttcttc cttctgtttc tcaagctcat gctccaactt 1860 ttttattgtc ataccatgaa accgtaagtt atcatactgg aagttgaatc tactgtactc 1920 gaaaacgtct gttggttttg cgttagaatg agtgaacgca ttgtgattaa taggatcaga 1980 gaatggccgt aggggcctct tcattgcctg gatttcgcag ttggcatggt cgtatgggag 2040 acctctgtat tcttgaagag cttgccaaat agcccatatt cggtcagtat tggcatgatg 2100 aagccagaaa agaggatcat aggtcgtgta ctccagagtg gacattccgt agggggtcag 2160 tccaccagtc caagaatgga cggcattgtg ggtaatctcg aactggactt caaactgaca 2220 aaaatctgtc tgctccaaag ccaagagaac ctgcctgtag aagaacgatt ctgatcctcg 2280 ctctggatca ttgaacaact tgtctcgggg cgaccgagtt gtatccactc ccaaatagtc 2340 tatgtgacca tgatggaaag gattgtcctc ttcatctgtg acaaaagttg gcaaagctgt 2400 aaaggcagtt gtccagtccc agtaaggaag gccgactctg gcaccatggg cggtcaaggc 2460 atcttccatc tgttttgtgt acagtctgtg ccagtggggg aaagtagcca tgccatgtgt 2520 acaacacgag taattctttc catctggcat agggcacatt ggtgggtttc catggaacgc 2580 tgctatagcc tggtatccat ttggaccgtg gtctgccatg acggctctca tggcatcttt 2640 gagattgtcc acctctgcag tggtaagtgt gttgatttcc tttctgacac catgtccagc 2700 catagaagaa gaatgatgat cttcgtgatg gtcgagacca ggttcataga taatggttgg 2760 tgcatgcagg attgaagccg gaagaactgt tccgttgacg gctgtcactg acaccttgat 2820 gaagaatggt tcagagggat ggaatacgtc ttctggagtg atgtgtgcgt catgaagagc 2880 atgggtaata tcgtacttga ataaacggtt ataagcccag gccatctctt tagtcccccc 2940 aagaacgaag atttggcctc catggtgaca atcttctgat gttttacaaa tctggaattg 3000 gacatcagct gaggttctaa ttccgtgaag gaggaaccct gcaaacactc tcacatggtg 3060 ctgtttgtct ttgatttcct tttcaatttc atgcaaattc atgccaccga tttccagatt 3120 gtcgtatctg taacccagga gttcgtactt gaaaacatca tttggaactg catgtttctt 3180 ggtgaactgg ttatggttaa tttcgtgatg gaaaggcctc atgttctgag tcatgagact 3240 aacagcgcag tcagcacggt ctgatggaag gcctctcttt tcttgaagag cctgccagac 3300 agcccaaacc ttgtctacaa aggagtggtg tatgaagaag atcggatcat atgaagcata 3360 atgaagggat gacaaagaat agacttgagg tcctcccact aggtaatgga ttgtgttgtg 3420 catgacttca aactggactg caaaatcgca gtagtcgtgc tgttccagcg ccaaaagagc 3480 ttgattcaga agcgctgagt gttcacccgt tgatgtcagg tggaacaggc cttcttggac 3540 gtctcttacc gtgtatgtat cgtgtgaggg gacataccct cgggcaaatg gatttggcat 3600 tactttattt ctccagacat cgtagtattc tttgtctgtg aacagatgtg gaatattatg 3660 tattggcttt gtccagtccc agtagggtac tgctacacta gagccatgtc ttcttagagc 3720 ttgctcaaat tggagggtgt agagtctgtg ccagtggggg aatgtaggca ttccatggat 3780 acagcatgcc acctcgttat tgtggctgtc attacatttg gcaggcagac catggaagga 3840 tgcaatggca gcaaaaccat ctgctgaggt atctgcctgc aagctgctca aagcatattt 3900 aaggcttcca atatctcgct cactcagtgt actcagatca tgacgaacct tgttgagcga 3960 caggtggtgc gaactaatat catgatgagc aggatcatag atgattgacg gtgtgggaat 4020 aaggccacta tcgagtacag ttccatcgta ggatgtaatg gttacttcaa tctggaagat 4080 ttcgtcatgt cgaaggttca tcttgtctaa ggtttctgtt atgtcgtact tgtaaagccg 4140 atcaaaggcc caaggcattt ctttttcacc tcccaacacg gtgaagtacc ctgcatcttc 4200 acagtctccc gctgtgcgac agacctggaa atcgacagtg gcagaggtgc caaatccttc 4260 aagcaagaat ccggcaaaga cccttgaatg ttcctgcctg ttacggacaa tatggtcaat 4320 gccacgaatt gaccaccctg caaattccag agtgtcatat atgtatccaa acttttggta 4380 gtcgaatgta tcttcaggct ttgagaaatc atgcgtgtga tcattaagat tatatggtgc 4440 tccaaagctg aaaggcttca gaggagtttt catgagattt acagcacagt tagcctcaga 4500 tccagaaagt cctctgaaac gctgcaacga ttgccatata gcccaaatac gatcagtttg 4560 ggagtgatgg atgtagaaaa gtggatcata ggaagcataa tgcagatggc caattccata 4620 gggctccttg cctccaaccc aggtatgaac accattatgc ataatttcaa actgaacctc 4680 gaagtcgcag tagttggttt gttcaagagc aagcagtgct tgctcaataa accagttatg 4740 atgtgttttt gttgatccct ggaagagacg atccctattg atatctcttt tagtatgtac 4800 gttttctcct tcaaactcta ttttagaacc aataaatgga tttggtatat ctctgcctgt 4860 ctccgggtca tgaatagttg agacggtcag gagctctggt aattcagaac gaggtagggt 4920 ccagtcccag tacggaagtc caacgactgc accatgtttt ctgagagaat cttggaattg 4980 gactgtgtac agacggtgcc attgtgggaa cgttgccatg ccatggacgc agcacgcaaa 5040 cctctttgaa gcagctggtg atggacaaag aggaggaaga gcatggaagg aagcgattgc 5100 ctggaaccca tcagatgaat ggtcggcttt catggactcg agggctttca ccagtgacag 5160 ttgttgacgc ttgtccaatt gtgtgatatc tttgcgaatc atcactggtt cagtgtgtcc 5220 atcatctgtg tgggcagaat ctgttccggc ctcaaacagt atagtgggag agtggatgac 5280 atgactgtct agatgagatc cattcacagc cgtaatctca atttccagat cgaaatcccc 5340 ttgcaggtgc acaccgttct ttttcaaggc tttagttatg tcaagtttgt aaagtctgtc 5400 aaatacaaag gacatctccg tttgaccgcc taagatggag aacataccag cctcatgact 5460 gcagtccttc gtgtgttccc cagttgggag acatacaaat acatggacat cggcagatgt 5520 tccaatattg tgaaggagga agccagcgaa tactctgtca tgcctttttc tttcttgcag 5580 catgtgatca agctgaggaa tcgagtagtg attcagttcc agattatcgt aagcataacc 5640 aaagctgttt tgatattcaa aaccatcatg tggagtggca tgcgttttcg ttgcctcgtc 5700 attgttgtcg ctcttatcaa acggctggag tggcgtgtgc atcaattgga tagcgcaatt 5760 cgcttctcgg tatggccgtt ttcggaacct ctgaaggtct tgccagattg cccagattct 5820 gtcagtcgtt gcgtgatgaa ggaagaacac aggatcaaaa gcagcgtaat caaggtttga 5880 catggaatat ttacctttgc caccaagtaa agaatgcaat gcattgtgta atatctccaa 5940 ctgaatctcg aaatcacaga agttatcctg ctccaaggcg taaaggacct gctcgtaaaa 6000 gtagtctgaa tggaagaggc tgtccttggg atccctagta gtgatttcat tctcgtgtgt 6060 gattttgcca tgatggaatg ggtttgtctc atagtgatgt tgcctggaat tgtagtaagt 6120 ggcgtctgaa atcaggtgag gtaaatgttc gattcgtttt gtccagtccc aatagggaac 6180 agctacagat gatccttttt tcaggagggc attctcaacg acaaccacgt acagtctgtg 6240 ccagtgaggg aaagttggca tgccatggac acaacaagag atggggttac catttaactg 6300 gcacaaacca gggctaccat ggaatttagc tattgattcg taagttccat cgttttcaat 6360 ttccaggaag gcagctctca gactttcgag ttcaccaagg ctcagatttt caatgttttt 6420 tcgaatacgg ttggcagaag ttacttcagc ttgatatact tcgccttcat ggtgacctat 6480 tcttggttca tgaatgactg aaggctgctt gaacaaatct tttcccaggc ttgctggttt 6540 aaggtcaaat acttcatagt caatgtagaa gtggtcttct gcatgcagct ttagatcgtg 6600 gagtttttcg gttatgtcat atttgaagac tctatcgtat gcccatggca tctcgttttc 6660 atcacccaga aggtaaaact ctccagcggg gtgacagttg ctgctatcgg tgcaaatatt 6720 gaatttaaca agagatgatt tcttgaaacc agataacagg aagcctgcaa agaacctgtc 6780 ttggctcttt atcgcttcga gctttttatt caactgagag attgagagac cgttaaattc 6840 caaagtgtca tattcataat tgaaattggt cttgtaatca aaaacgttga atggcactga 6900 atgctccttg gttacatggt ctgtgttgat attggcagag aggccaaatg gttgcaaagg 6960 ttctctcatc gatgtaacag cacagttagc aacgttgtac ggttttcctc tgtacttctg 7020 taaagcctgc catattgccc atatacgatc tgtattagag tgatgcaagt agaatagtgg 7080 atcaaaagca gtgtagcgaa gcgatgccat accataaggc tgagcgcctc ctacaagtgc 7140 atggatgtag ttatgggcaa tctcatattg gatttcaaag tcacagaaat tgtcctgttc 7200 caacgctagc agtacttgtt tggcaatgct tgtataatga ccaaactcag gctgttcgaa 7260 gagtttattc tgaacacttc ttgttgttgt tttgtcgact gtatctatat gaccatcgaa 7320 ccaagggtta tgcttctcta ccccatcttc agggtcgacg tacttctcat gatctgccag 7380 ttcgggaagg tgattaagag gagaggtcca atcccagtac ggcaaagctc catcgtagcc 7440 atgtcgtctc aaagcatttt cactctgaac cgtcaacagt cgatgccagt gagggaacac 7500 agacatgccg tgaacacagc aggcaaattt tttagatgct tcgggactag gacaccattt 7560 tggctcccca tggaatgcac caagctgctg aaatccaccg gcacttttgt cctctttcag 7620 taaggctagg gctttacgaa gttcattcat ctcctcgggg ctcaaacggt ctacattttt 7680 tcgaataaga cgactccttg gcactttgtc aaatttaaca cgtgcatgct cacgaatgac 7740 agaagcatgt ggaataagac tggatgcaag tttagttcca tcgacttcag tgatgtcaac 7800 agtaacttcg aaatcgccat cagcagtgag atcgagatct ttcaaaacgt gcgtgatgtc 7860 aaacttgaaa acgcgatcaa atccccactt catttccttg cttccaccga gcactgcaaa 7920 tgttccagcc ttatgttgca cggaatcggt agttttaatg aagatatcaa catttgctga 7980 agtacgtatg ccagccagga gaaaaccggc atatattctg tcttcttgct ggttttcgtg 8040 gatcatcttt tctatgtttt ccagagagat gcctccaaat gttaaatctt cgtatgtgta 8100 atggaggaca ttttcatagt cgtagatctt gtttggcatg gcattggcat gagtcttttc 8160 gttattgtta aggggagatg aaaaggcgaa tggtttcaga tgcatatgtt ccagagatat 8220 ggcgcagtgg gccctgtagg gtttatgccg tctcatctgt aaggcttgcc aaacggccca 8280 aagacgatca acgttagaat ggtgaaagta aaacaaagga tcgaaagctg tgtaatgtag 8340 ggacgacatt gagaaatgtt cgcttcctcc tgtccaagcg tgaatcgtgt tatgggtaat 8400 ctcaaattgg acttcaaagt cacagtaatc ttcttgttca aaggcataca aaatctggtt 8460 gaacaggtct gtgtggtgtc caaaggtagc gggtttaaag agtctttgat ctatttgtct 8520 tttggtgtgg ggagcatttt cttcaaatgc aatcactgaa ctatgaaaag gatttgtgtg 8580 ggatgcacca tgagaatcca cataagtttc gtttcctgcc agacccggaa tgtgtgtcat 8640 gggttcagtc cagtcccagt atgggatacc aatggtagct ccacggccta ctaaagcatc 8700 ttcaacttgt gtgacaaaaa gcctgtgcca gtgagggaag attggcattc catggacaca 8760 acaagcatat ctgtccttgg catcagggcg agggcatctg gccggaagac catgatactc 8820 agctgtggct tgataaccgt cgacggagcg atcttcttgg aatttatgaa acgcctctct 8880 cagattcaac acttcttcac gggtgagatc attgatgttc tttctaacga ggagattggc 8940 gctttggtga tgcttcaccg gaggatcagt gaaacccttc tccggatgat gaacaacaac 9000 tggatgaggt aacaaatcat cgggaagagc tgtaccattg acactgaaaa gctctgcctt 9060 gacggtatag tgtccatcca acttcattcc catcttgtga acagtgtcgg acaagtcaaa 9120 caagtatggc ctgtagaact gccatttcat ctcgcttggt ccgcccagaa cgaagaaatc 9180 accggccttc tcacagtggt catcattacg atcgttgtcg tcggggatgc agacgtacac 9240 aaagacattg gctgaacctc caaatccttt cagacggaag cttgcgaagg ctcttgccct 9300 tgaagaccgc tcatccaaat aggttttcag ttcctctgga gtcataccat taagattcaa 9360 cgtgtcatag tcgtagtgaa gttgacgata gtcaaagaga tcggctggag tggaatgttc 9420 atttgttagt ggtatgtcat tgtcctccca actgaaaggt tccattttct gatgtaacaa 9480 ttcttgggca caatccatag ctttggggtc tttgtttcgg agctcttgga gcctctgcca 9540 gatggcgaaa attctgtcaa cgttggagtg gtgaagaaag aagatcggat cgtaggcggt 9600 gtactcgagg tttgccatag agtaatcatg ctttccaccg accaggtagt gaatcgcatt 9660 atgtgctaac tcgaactgga tttcgaattt acagaattca tcctgctcca gtgcgtccaa 9720 aacactctcc ataagatggg tgtgctggcc tggcttcacc ttctcaaaaa gacggtcgtc 9780 gaccgcacga ttggttttct tgtctagaaa gtcaatgttc cctctgtacc agtagttggt 9840 gtgagccttt ccgccctggg aatcgatgta aacagggtcc gcagcgagag agggcaactg 9900 ggtgatgggt tctgtccagt cccagtatgg catcccagaa gtacgcctct tcgtctgcag 9960 agctcgctcc atgtgaacta cgtacgccct gtgccagtgc gggaaagtag gcatgccgtg 10020 aatacagcaa gctatagaag tgtccttatg cacacaagat gcgggcgcgc catgagcagc 10080 agctattttc tggaaaccaa tgctggatga gtccatctgc agctcacgca gggcagcctg 10140 caggtccagg gtctcttcct gagtcagatg ctccacactc tttctgacca ggttctc 10197 24 3398 PRT Megathura crenulata SEQ ID NO24 is the deduced amino acid sequence of SEQ ID NO22 24 Glu Asn Leu Val Arg Lys Ser Val Glu His Leu Thr Gln Glu Glu Thr 1 5 10 15 Leu Asp Leu Gln Ala Ala Leu Arg Glu Leu Gln Met Asp Ser Ser Ser 20 25 30 Ile Gly Phe Gln Lys Ile Ala Ala Ala His Gly Ala Pro Ala Ser Cys 35 40 45 Val His Lys Asp Thr Ser Ile Ala Cys Cys Ile His Gly Met Pro Thr 50 55 60 Phe Pro His Trp His Arg Ala Tyr Val Val His Met Glu Arg Ala Leu 65 70 75 80 Gln Thr Lys Arg Arg Thr Ser Gly Met Pro Tyr Trp Asp Trp Thr Glu 85 90 95 Pro Ile Thr Gln Leu Pro Ser Leu Ala Ala Asp Pro Val Tyr Ile Asp 100 105 110 Ser Gln Gly Gly Lys Ala His Thr Asn Tyr Trp Tyr Arg Gly Asn Ile 115 120 125 Asp Phe Leu Asp Lys Lys Thr Asn Arg Ala Val Asp Asp Arg Leu Phe 130 135 140 Glu Lys Val Lys Pro Gly Gln His Thr His Leu Met Glu Ser Val Leu 145 150 155 160 Asp Ala Leu Glu Gln Asp Glu Phe Cys Lys Phe Glu Ile Gln Phe Glu 165 170 175 Leu Ala His Asn Ala Ile His Tyr Leu Val Gly Gly Lys His Asp Tyr 180 185 190 Ser Met Ala Asn Leu Glu Tyr Thr Ala Tyr Asp Pro Ile Phe Phe Leu 195 200 205 His His Ser Asn Val Asp Arg Ile Phe Ala Ile Trp Gln Arg Leu Gln 210 215 220 Glu Leu Arg Asn Lys Asp Pro Lys Ala Met Asp Cys Ala Gln Glu Leu 225 230 235 240 Leu His Gln Lys Met Glu Pro Phe Ser Trp Glu Asp Asn Asp Ile Pro 245 250 255 Leu Thr Asn Glu His Ser Thr Pro Ala Asp Leu Phe Asp Tyr Arg Gln 260 265 270 Leu His Tyr Asp Tyr Asp Thr Leu Asn Leu Asn Gly Met Thr Pro Glu 275 280 285 Glu Leu Lys Thr Tyr Leu Asp Glu Arg Ser Ser Arg Ala Arg Ala Phe 290 295 300 Ala Ser Phe Arg Leu Lys Gly Phe Gly Gly Ser Ala Asn Val Phe Val 305 310 315 320 Tyr Val Cys Ile Pro Asp Asp Asn Asp Arg Asn Asp Asp His Cys Glu 325 330 335 Lys Ala Gly Asp Phe Phe Val Leu Gly Gly Pro Ser Glu Met Lys Trp 340 345 350 Gln Phe Tyr Arg Pro Tyr Leu Phe Asp Leu Ser Asp Thr Val His Lys 355 360 365 Met Gly Met Lys Leu Asp Gly His Tyr Thr Val Lys Ala Glu Leu Phe 370 375 380 Ser Val Asn Gly Thr Ala Leu Pro Asp Asp Leu Leu Pro His Pro Val 385 390 395 400 Val Val His His Pro Glu Lys Gly Phe Thr Asp Pro Pro Val Lys His 405 410 415 His Gln Ser Ala Asn Leu Leu Val Arg Lys Asn Ile Asn Asp Leu Thr 420 425 430 Arg Glu Glu Val Leu Asn Leu Arg Glu Ala Phe His Lys Phe Gln Glu 435 440 445 Asp Arg Ser Val Asp Gly Tyr Gln Ala Thr Ala Glu Tyr His Gly Leu 450 455 460 Pro Ala Arg Cys Pro Arg Pro Asp Ala Lys Asp Arg Tyr Ala Cys Cys 465 470 475 480 Val His Gly Met Pro Ile Phe Pro His Trp His Arg Leu Phe Val Thr 485 490 495 Gln Val Glu Asp Ala Leu Val Gly Arg Gly Ala Thr Ile Gly Ile Pro 500 505 510 Tyr Trp Asp Trp Thr Glu Pro Met Thr His Ile Pro Gly Leu Ala Gly 515 520 525 Asn Glu Thr Tyr Val Asp Ser His Gly Ala Ser His Thr Asn Pro Phe 530 535 540 His Ser Ser Val Ile Ala Phe Glu Glu Asn Ala Pro His Thr Lys Arg 545 550 555 560 Gln Ile Asp Gln Arg Leu Phe Lys Pro Ala Thr Phe Gly His His Thr 565 570 575 Asp Leu Phe Asn Gln Ile Leu Tyr Ala Phe Glu Gln Glu Asp Tyr Cys 580 585 590 Asp Phe Glu Val Gln Phe Glu Ile Thr His Asn Thr Ile His Ala Trp 595 600 605 Thr Gly Gly Ser Glu His Phe Ser Met Ser Ser Leu His Tyr Thr Ala 610 615 620 Phe Asp Pro Leu Phe Tyr Phe His His Ser Asn Val Asp Arg Leu Trp 625 630 635 640 Ala Val Trp Gln Ala Leu Gln Met Arg Arg His Lys Pro Tyr Arg Ala 645 650 655 His Cys Ala Ile Ser Leu Glu His Met His Leu Lys Pro Phe Ala Phe 660 665 670 Ser Ser Pro Leu Asn Asn Asn Glu Lys Thr His Ala Asn Ala Met Pro 675 680 685 Asn Lys Ile Tyr Asp Tyr Glu Asn Val Leu His Tyr Thr Tyr Glu Asp 690 695 700 Leu Thr Phe Gly Gly Ile Ser Leu Glu Asn Ile Glu Lys Met Ile His 705 710 715 720 Glu Asn Gln Gln Glu Asp Arg Ile Tyr Ala Gly Phe Leu Leu Ala Gly 725 730 735 Ile Arg Thr Ser Ala Asn Val Asp Ile Phe Ile Lys Thr Thr Asp Ser 740 745 750 Val Gln His Lys Ala Gly Thr Phe Ala Val Leu Gly Gly Ser Lys Glu 755 760 765 Met Lys Trp Gly Phe Asp Arg Val Phe Lys Phe Asp Ile Thr His Val 770 775 780 Leu Lys Asp Leu Asp Leu Thr Ala Asp Gly Asp Phe Glu Val Thr Val 785 790 795 800 Asp Ile Thr Glu Val Asp Gly Thr Lys Leu Ala Ser Ser Leu Ile Pro 805 810 815 His Ala Ser Val Ile Arg Glu His Ala Arg Val Lys Phe Asp Lys Val 820 825 830 Pro Arg Ser Arg Leu Ile Arg Lys Asn Val Asp Arg Leu Ser Pro Glu 835 840 845 Glu Met Asn Glu Leu Arg Lys Ala Leu Ala Leu Leu Lys Glu Asp Lys 850 855 860 Ser Ala Gly Gly Phe Gln Gln Leu Gly Ala Phe His Gly Glu Pro Lys 865 870 875 880 Trp Cys Pro Ser Pro Glu Ala Ser Lys Lys Phe Ala Cys Cys Val His 885 890 895 Gly Met Ser Val Phe Pro His Trp His Arg Leu Leu Thr Val Gln Ser 900 905 910 Glu Asn Ala Leu Arg Arg His Gly Tyr Asp Gly Ala Leu Pro Tyr Trp 915 920 925 Asp Trp Thr Ser Pro Leu Asn His Leu Pro Glu Leu Ala Asp His Glu 930 935 940 Lys Tyr Val Asp Pro Glu Asp Gly Val Glu Lys His Asn Pro Trp Phe 945 950 955 960 Asp Gly His Ile Asp Thr Val Asp Lys Thr Thr Thr Arg Ser Val Gln 965 970 975 Asn Lys Leu Phe Glu Gln Pro Glu Phe Gly His Tyr Thr Ser Ile Ala 980 985 990 Lys Gln Val Leu Leu Ala Leu Glu Gln Asp Asn Phe Cys Asp Phe Glu 995 1000 1005 Ile Gln Tyr Glu Ile Ala His Asn Tyr Ile His Ala Leu Val Gly 1010 1015 1020 Gly Ala Gln Pro Tyr Gly Met Ala Ser Leu Arg Tyr Thr Ala Phe 1025 1030 1035 Asp Pro Leu Phe Tyr Leu His His Ser Asn Thr Asp Arg Ile Trp 1040 1045 1050 Ala Ile Trp Gln Ala Leu Gln Lys Tyr Arg Gly Lys Pro Tyr Asn 1055 1060 1065 Val Ala Asn Cys Ala Val Thr Ser Met Arg Glu Pro Leu Gln Pro 1070 1075 1080 Phe Gly Leu Ser Ala Asn Ile Asn Thr Asp His Val Thr Lys Glu 1085 1090 1095 His Ser Val Pro Phe Asn Val Phe Asp Tyr Lys Thr Asn Phe Asn 1100 1105 1110 Tyr Glu Tyr Asp Thr Leu Glu Phe Asn Gly Leu Ser Ile Ser Gln 1115 1120 1125 Leu Asn Lys Lys Leu Glu Ala Ile Lys Ser Gln Asp Arg Phe Phe 1130 1135 1140 Ala Gly Phe Leu Leu Ser Gly Phe Lys Lys Ser Ser Leu Val Lys 1145 1150 1155 Phe Asn Ile Cys Thr Asp Ser Ser Asn Cys His Pro Ala Gly Glu 1160 1165 1170 Phe Tyr Leu Leu Gly Asp Glu Asn Glu Met Pro Trp Ala Tyr Asp 1175 1180 1185 Arg Val Phe Lys Tyr Asp Ile Thr Glu Lys Leu His Asp Leu Lys 1190 1195 1200 Leu His Ala Glu Asp His Phe Tyr Ile Asp Tyr Glu Val Phe Asp 1205 1210 1215 Leu Lys Pro Ala Ser Leu Gly Lys Asp Leu Phe Lys Gln Pro Ser 1220 1225 1230 Val Ile His Glu Pro Arg Ile Gly His His Glu Gly Glu Val Tyr 1235 1240 1245 Gln Ala Glu Val Thr Ser Ala Asn Arg Ile Arg Lys Asn Ile Glu 1250 1255 1260 Asn Leu Ser Leu Gly Glu Leu Glu Ser Leu Arg Ala Ala Phe Leu 1265 1270 1275 Glu Ile Glu Asn Asp Gly Thr Tyr Glu Ser Ile Ala Lys Phe His 1280 1285 1290 Gly Ser Pro Gly Leu Cys Gln Leu Asn Gly Asn Pro Ile Ser Cys 1295 1300 1305 Cys Val His Gly Met Pro Thr Phe Pro His Trp His Arg Leu Tyr 1310 1315 1320 Val Val Val Val Glu Asn Ala Leu Leu Lys Lys Gly Ser Ser Val 1325 1330 1335 Ala Val Pro Tyr Trp Asp Trp Thr Lys Arg Ile Glu His Leu Pro 1340 1345 1350 His Leu Ile Ser Asp Ala Thr Tyr Tyr Asn Ser Arg Gln His His 1355 1360 1365 Tyr Glu Thr Asn Pro Phe His His Gly Lys Ile Thr His Glu Asn 1370 1375 1380 Glu Ile Thr Thr Arg Asp Pro Lys Asp Ser Leu Phe His Ser Asp 1385 1390 1395 Tyr Phe Tyr Glu Gln Val Leu Tyr Ala Leu Glu Gln Asp Asn Phe 1400 1405 1410 Cys Asp Phe Glu Ile Gln Leu Glu Ile Leu His Asn Ala Leu His 1415 1420 1425 Ser Leu Leu Gly Gly Lys Gly Lys Tyr Ser Met Ser Asn Leu Asp 1430 1435 1440 Tyr Ala Ala Phe Asp Pro Val Phe Phe Leu His His Ala Thr Thr 1445 1450 1455 Asp Arg Ile Trp Ala Ile Trp Gln Asp Leu Gln Arg Phe Arg Lys 1460 1465 1470 Arg Pro Tyr Arg Glu Ala Asn Cys Ala Ile Gln Leu Met His Thr 1475 1480 1485 Pro Leu Gln Pro Phe Asp Lys Ser Asp Asn Asn Asp Glu Ala Thr 1490 1495 1500 Lys Thr His Ala Thr Pro His Asp Gly Phe Glu Tyr Gln Asn Ser 1505 1510 1515 Phe Gly Tyr Ala Tyr Asp Asn Leu Glu Leu Asn His Tyr Ser Ile 1520 1525 1530 Pro Gln Leu Asp His Met Leu Gln Glu Arg Lys Arg His Asp Arg 1535 1540 1545 Val Phe Ala Gly Phe Leu Leu His Asn Ile Gly Thr Ser Ala Asp 1550 1555 1560 Val His Val Phe Val Cys Leu Pro Thr Gly Glu His Thr Lys Asp 1565 1570 1575 Cys Ser His Glu Ala Gly Met Phe Ser Ile Leu Gly Gly Gln Thr 1580 1585 1590 Glu Met Ser Phe Val Phe Asp Arg Leu Tyr Lys Leu Asp Ile Thr 1595 1600 1605 Lys Ala Leu Lys Lys Asn Gly Val His Leu Gln Gly Asp Phe Asp 1610 1615 1620 Leu Glu Ile Glu Ile Thr Ala Val Asn Gly Ser His Leu Asp Ser 1625 1630 1635 His Val Ile His Ser Pro Thr Ile Leu Phe Glu Ala Gly Thr Asp 1640 1645 1650 Ser Ala His Thr Asp Asp Gly His Thr Glu Pro Val Met Ile Arg 1655 1660 1665 Lys Asp Ile Thr Gln Leu Asp Lys Arg Gln Gln Leu Ser Leu Val 1670 1675 1680 Lys Ala Leu Glu Ser Met Lys Ala Asp His Ser Ser Asp Gly Phe 1685 1690 1695 Gln Ala Ile Ala Ser Phe His Ala Leu Pro Pro Leu Cys Pro Ser 1700 1705 1710 Pro Ala Ala Ser Lys Arg Phe Ala Cys Cys Val His Gly Met Ala 1715 1720 1725 Thr Phe Pro Gln Trp His Arg Leu Tyr Thr Val Gln Phe Gln Asp 1730 1735 1740 Ser Leu Arg Lys His Gly Ala Val Val Gly Leu Pro Tyr Trp Asp 1745 1750 1755 Trp Thr Leu Pro Arg Ser Glu Leu Pro Glu Leu Leu Thr Val Ser 1760 1765 1770 Thr Ile His Asp Pro Glu Thr Gly Arg Asp Ile Pro Asn Pro Phe 1775 1780 1785 Ile Gly Ser Lys Ile Glu Phe Glu Gly Glu Asn Val His Thr Lys 1790 1795 1800 Arg Asp Ile Asn Arg Asp Arg Leu Phe Gln Gly Ser Thr Lys Thr 1805 1810 1815 His His Asn Trp Phe Ile Glu Gln Ala Leu Leu Ala Leu Glu Gln 1820 1825 1830 Thr Asn Tyr Cys Asp Phe Glu Val Gln Phe Glu Ile Met His Asn 1835 1840 1845 Gly Val His Thr Trp Val Gly Gly Lys Glu Pro Tyr Gly Ile Gly 1850 1855 1860 His Leu His Tyr Ala Ser Tyr Asp Pro Leu Phe Tyr Ile His His 1865 1870 1875 Ser Gln Thr Asp Arg Ile Trp Ala Ile Trp Gln Ser Leu Gln Arg 1880 1885 1890 Phe Arg Gly Leu Ser Gly Ser Glu Ala Asn Cys Ala Val Asn Leu 1895 1900 1905 Met Lys Thr Pro Leu Lys Pro Phe Ser Phe Gly Ala Pro Tyr Asn 1910 1915 1920 Leu Asn Asp His Thr His Asp Phe Ser Lys Pro Glu Asp Thr Phe 1925 1930 1935 Asp Tyr Gln Lys Phe Gly Tyr Ile Tyr Asp Thr Leu Glu Phe Ala 1940 1945 1950 Gly Trp Ser Ile Arg Gly Ile Asp His Ile Val Arg Asn Arg Gln 1955 1960 1965 Glu His Ser Arg Val Phe Ala Gly Phe Leu Leu Glu Gly Phe Gly 1970 1975 1980 Thr Ser Ala Thr Val Asp Phe Gln Val Cys Arg Thr Ala Gly Asp 1985 1990 1995 Cys Glu Asp Ala Gly Tyr Phe Thr Val Leu Gly Gly Glu Lys Glu 2000 2005 2010 Met Pro Trp Ala Phe Asp Arg Leu Tyr Lys Tyr Asp Ile Thr Glu 2015 2020 2025 Thr Leu Asp Lys Met Asn Leu Arg His Asp Glu Ile Phe Gln Ile 2030 2035 2040 Glu Val Thr Ile Thr Ser Tyr Asp Gly Thr Val Leu Asp Ser Gly 2045 2050 2055 Leu Ile Pro Thr Pro Ser Ile Ile Tyr Asp Pro Ala His His Asp 2060 2065 2070 Ile Ser Ser His His Leu Ser Leu Asn Lys Val Arg His Asp Leu 2075 2080 2085 Ser Thr Leu Ser Glu Arg Asp Ile Gly Ser Leu Lys Tyr Ala Leu 2090 2095 2100 Ser Ser Leu Gln Ala Asp Thr Ser Ala Asp Gly Phe Ala Ala Ile 2105 2110 2115 Ala Ser Phe His Gly Leu Pro Ala Lys Cys Asn Asp Ser His Asn 2120 2125 2130 Asn Glu Val Ala Cys Cys Ile His Gly Met Pro Thr Phe Pro His 2135 2140 2145 Trp His Arg Leu Tyr Thr Leu Gln Phe Glu Gln Ala Leu Arg Arg 2150 2155 2160 His Gly Ser Ser Val Ala Val Pro Tyr Trp Asp Trp Thr Lys Pro 2165 2170 2175 Ile His Asn Ile Pro His Leu Phe Thr Asp Lys Glu Tyr Tyr Asp 2180 2185 2190 Val Trp Arg Asn Lys Val Met Pro Asn Pro Phe Ala Arg Gly Tyr 2195 2200 2205 Val Pro Ser His Asp Thr Tyr Thr Val Arg Asp Val Gln Glu Gly 2210 2215 2220 Leu Phe His Leu Thr Ser Thr Gly Glu His Ser Ala Leu Leu Asn 2225 2230 2235 Gln Ala Leu Leu Ala Leu Glu Gln His Asp Tyr Cys Asp Phe Ala 2240 2245 2250 Val Gln Phe Glu Val Met His Asn Thr Ile His Tyr Leu Val Gly 2255 2260 2265 Gly Pro Gln Val Tyr Ser Leu Ser Ser Leu His Tyr Ala Ser Tyr 2270 2275 2280 Asp Pro Ile Phe Phe Ile His His Ser Phe Val Asp Lys Val Trp 2285 2290 2295 Ala Val Trp Gln Ala Leu Gln Glu Lys Arg Gly Leu Pro Ser Asp 2300 2305 2310 Arg Ala Asp Cys Ala Val Ser Leu Met Thr Gln Asn Met Arg Pro 2315 2320 2325 Phe His His Glu Ile Asn His Asn Gln Phe Thr Lys Lys His Ala 2330 2335 2340 Val Pro Asn Asp Val Phe Lys Tyr Glu Leu Leu Gly Tyr Arg Tyr 2345 2350 2355 Asp Asn Leu Glu Ile Gly Gly Met Asn Leu His Glu Ile Glu Lys 2360 2365 2370 Glu Ile Lys Asp Lys Gln His His Val Arg Val Phe Ala Gly Phe 2375 2380 2385 Leu Leu His Gly Ile Arg Thr Ser Ala Asp Val Gln Phe Gln Ile 2390 2395 2400 Cys Lys Thr Ser Glu Asp Cys His His Gly Gly Gln Ile Phe Val 2405 2410 2415 Leu Gly Gly Thr Lys Glu Met Ala Trp Ala Tyr Asn Arg Leu Phe 2420 2425 2430 Lys Tyr Asp Ile Thr His Ala Leu His Asp Ala His Ile Thr Pro 2435 2440 2445 Glu Asp Val Phe His Pro Ser Glu Pro Phe Phe Ile Lys Val Ser 2450 2455 2460 Val Thr Ala Val Asn Gly Thr Val Leu Pro Ala Ser Ile Leu His 2465 2470 2475 Ala Pro Thr Ile Ile Tyr Glu Pro Gly Leu Asp His His Glu Asp 2480 2485 2490 His His Ser Ser Ser Met Ala Gly His Gly Val Arg Lys Glu Ile 2495 2500 2505 Asn Thr Leu Thr Thr Ala Glu Val Asp Asn Leu Lys Asp Ala Met 2510 2515 2520 Arg Ala Val Met Ala Asp His Gly Pro Asn Gly Tyr Gln Ala Ile 2525 2530 2535 Ala Ala Phe His Gly Asn Pro Pro Met Cys Pro Met Pro Asp Gly 2540 2545 2550 Lys Asn Tyr Ser Cys Cys Thr His Gly Met Ala Thr Phe Pro His 2555 2560 2565 Trp His Arg Leu Tyr Thr Lys Gln Met Glu Asp Ala Leu Thr Ala 2570 2575 2580 His Gly Ala Arg Val Gly Leu Pro Tyr Trp Asp Trp Thr Thr Ala 2585 2590 2595 Phe Thr Ala Leu Pro Thr Phe Val Thr Asp Glu Glu Asp Asn Pro 2600 2605 2610 Phe His His Gly His Ile Asp Tyr Leu Gly Val Asp Thr Thr Arg 2615 2620 2625 Ser Pro Arg Asp Lys Leu Phe Asn Asp Pro Glu Arg Gly Ser Glu 2630 2635 2640 Ser Phe Phe Tyr Arg Gln Val Leu Leu Ala Leu Glu Gln Thr Asp 2645 2650 2655 Phe Cys Gln Phe Glu Val Gln Phe Glu Ile Thr His Asn Ala Val 2660 2665 2670 His Ser Trp Thr Gly Gly Leu Thr Pro Tyr Gly Met Ser Thr Leu 2675 2680 2685 Glu Tyr Thr Thr Tyr Asp Pro Leu Phe Trp Leu His His Ala Asn 2690 2695 2700 Thr Asp Arg Ile Trp Ala Ile Trp Gln Ala Leu Gln Glu Tyr Arg 2705 2710 2715 Gly Leu Pro Tyr Asp His Ala Asn Cys Glu Ile Gln Ala Met Lys 2720 2725 2730 Arg Pro Leu Arg Pro Phe Ser Asp Pro Ile Asn His Asn Ala Phe 2735 2740 2745 Thr His Ser Asn Ala Lys Pro Thr Asp Val Phe Glu Tyr Ser Arg 2750 2755 2760 Phe Asn Phe Gln Tyr Asp Asn Leu Arg Phe His Gly Met Thr Ile 2765 2770 2775 Lys Lys Leu Glu His Glu Leu Glu Lys Gln Lys Glu Glu Asp Arg 2780 2785 2790 Thr Phe Ala Ala Phe Leu Leu His Gly Ile Lys Lys Ser Ala Asp 2795 2800 2805 Val Ser Phe Asp Val Cys Asn His Asp Gly Glu Cys His Phe Ala 2810 2815 2820 Gly Thr Phe Ala Ile Leu Gly Gly Glu His Glu Met Pro Trp Ser 2825 2830 2835 Phe Asp Arg Leu Phe Arg Tyr Asp Ile Thr Gln Val Leu Lys Gln 2840 2845 2850 Met His Leu Glu Tyr Asp Ser Asp Phe Thr Phe His Met Arg Ile 2855 2860 2865 Ile Asp Thr Ser Gly Lys Gln Leu Pro Ser Asp Leu Ile Lys Met 2870 2875 2880 Pro Thr Val Glu His Ser Pro Gly Gly Lys His His Glu Lys His 2885 2890 2895 His Glu Asp His His Glu Asp Ile Leu Val Arg Lys Asn Ile His 2900 2905 2910 Ser Leu Ser His His Glu Ala Glu Glu Leu Arg Asp Ala Leu Tyr 2915 2920 2925 Lys Leu Gln Asn Asp Glu Ser His Gly Gly Tyr Glu His Ile Ala 2930 2935 2940 Gly Phe His Gly Tyr Pro Asn Leu Cys Pro Glu Lys Gly Asp Glu 2945 2950 2955 Lys Tyr Pro Cys Cys Val His Gly Met Ser Ile Phe Pro His Trp 2960 2965 2970 His Arg Leu His Thr Ile Gln Phe Glu Arg Ala Leu Lys Lys His 2975 2980 2985 Gly Ser His Leu Gly Ile Pro Tyr Trp Asp Trp Thr Gln Thr Ile 2990 2995 3000 Ser Ser Leu Pro Thr Phe Phe Ala Asp Ser Gly Asn Asn Asn Pro 3005 3010 3015 Phe Phe Lys Tyr His Ile Arg Ser Ile Asn Gln Asp Thr Val Arg 3020 3025 3030 Asp Val Asn Glu Ala Ile Phe Gln Gln Thr Lys Phe Gly Glu Phe 3035 3040 3045 Ser Ser Ile Phe Tyr Leu Ala Leu Gln Ala Leu Glu Glu Asp Asn 3050 3055 3060 Tyr Cys Asp Phe Glu Val Gln Tyr Glu Ile Leu His Asn Glu Val 3065 3070 3075 His Ala Leu Ile Gly Gly Ala Glu Lys Tyr Ser Met Ser Thr Leu 3080 3085 3090 Glu Tyr Ser Ala Phe Asp Pro Tyr Phe Met Ile His His Ala Ser 3095 3100 3105 Leu Asp Lys Ile Trp Ile Ile Trp Gln Glu Leu Gln Lys Arg Arg 3110 3115 3120 Val Lys Pro Ala His Ala Gly Ser Cys Ala Gly Asp Ile Met His 3125 3130 3135 Val Pro Leu His Pro Phe Asn Tyr Glu Ser Val Asn Asn Asp Asp 3140 3145 3150 Phe Thr Arg Glu Asn Ser Leu Pro Asn Ala Val Val Asp Ser His 3155 3160 3165 Arg Phe Asn Tyr Lys Tyr Asp Asn Leu Asn Leu His Gly His Asn 3170 3175 3180 Ile Glu Glu Leu Glu Glu Val Leu Arg Ser Leu Arg Leu Lys Ser 3185 3190 3195 Arg Val Phe Ala Gly Phe Val Leu Ser Gly Ile Arg Thr Thr Ala 3200 3205 3210 Val Val Lys Val Tyr Ile Lys Ser Gly Thr Asp Ser Asp Asp Glu 3215 3220 3225 Tyr Ala Gly Ser Phe Val Ile Leu Gly Gly Ala Lys Glu Met Pro 3230 3235 3240 Trp Ala Tyr Glu Arg Leu Tyr Arg Phe Asp Ile Thr Glu Thr Val 3245 3250 3255 His Asn Leu Asn Leu Thr Asp Asp His Val Lys Phe Arg Phe Asp 3260 3265 3270 Leu Lys Lys Tyr Asp His Thr Glu Leu Asp Ala Ser Val Leu Pro 3275 3280 3285 Ala Pro Ile Ile Val Arg Arg Pro Asn Asn Ala Val Phe Asp Ile 3290 3295 3300 Ile Glu Ile Pro Ile Gly Lys Asp Val Asn Leu Pro Pro Lys Val 3305 3310 3315 Val Val Lys Arg Gly Thr Lys Ile Met Phe Met Ser Val Asp Glu 3320 3325 3330 Ala Val Thr Thr Pro Met Leu Asn Leu Gly Ser Tyr Thr Ala Met 3335 3340 3345 Phe Lys Cys Lys Val Pro Pro Phe Ser Phe His Ala Phe Glu Leu 3350 3355 3360 Gly Lys Met Tyr Ser Val Glu Ser Gly Asp Tyr Phe Met Thr Ala 3365 3370 3375 Ser Thr Thr Glu Leu Cys Asn Asp Asn Asn Leu Arg Ile His Val 3380 3385 3390 His Val Asp Asp Glu 3395 25 48 DNA Megathura crenulata 25 atgctgtcgg tcaggttgct tatagtcgtg ttggctttgg ccaatgca 48 26 48 DNA Megathura crenulata SEQ ID NO26 is the complementary sequence of the coding sequence of SEQ ID NO25 26 tgcattggcc aaagccaaca cgactataag caacctgacc gacagcat 48 27 16 PRT Megathura crenulata SEQ ID NO27 is the deduced amino acid sequence of SEQ ID NO25 27 Met Leu Ser Val Arg Leu Leu Ile Val Val Leu Ala Leu Ala Asn Ala 1 5 10 15 28 10245 DNA Megathura crenulata 28 atgctgtcgg tcaggttgct tatagtcgtg ttggctttgg ccaatgcaga gaacctggtc 60 agaaagagtg tggagcatct gactcaggaa gagaccctgg acctgcaggc tgccctgcgt 120 gagctgcaga tggactcatc cagcattggt ttccagaaaa tagctgctgc tcatggcgcg 180 cccgcatctt gtgtgcataa ggacacttct atagcttgct gtattcacgg catgcctact 240 ttcccgcact ggcacagggc gtacgtagtt cacatggagc gagctctgca gacgaagagg 300 cgtacttctg ggatgccata ctgggactgg acagaaccca tcacccagtt gccctctctc 360 gctgcggacc ctgtttacat cgattcccag ggcggaaagg ctcacaccaa ctactggtac 420 agagggaaca ttgactttct agacaagaaa accaatcgtg cggtcgacga ccgtcttttt 480 gagaaggtga agccaggcca gcacacccat cttatggaga gtgttttgga cgcactggag 540 caggatgaat tctgtaaatt cgaaatccag ttcgagttag cacataatgc gattcactac 600 ctggtcggtg gaaagcatga ttactctatg gcaaacctcg agtacaccgc ctacgatccg 660 atcttctttc ttcaccactc caacgttgac agaattttcg ccatctggca gaggctccaa 720 gagctccgaa acaaagaccc caaagctatg gattgtgccc aagaattgtt acatcagaaa 780 atggaacctt tcagttggga ggacaatgac ataccactaa caaatgaaca ttccactcca 840 gccgatctct ttgactatcg tcaacttcac tacgactatg acacgttgaa tcttaatggt 900 atgactccag aggaactgaa aacctatttg gatgagcggt cttcaagggc aagagccttc 960 gcaagcttcc gtctgaaagg atttggaggt tcagccaatg tctttgtgta cgtctgcatc 1020 cccgacgaca acgatcgtaa tgatgaccac tgtgagaagg ccggtgattt cttcgttctg 1080 ggcggaccaa gcgagatgaa atggcagttc tacaggccat acttgtttga cttgtccgac 1140 actgttcaca agatgggaat gaagttggat ggacactata ccgtcaaggc agagcttttc 1200 agtgtcaatg gtacagctct tcccgatgat ttgttacctc atccagttgt tgttcatcat 1260 ccggagaagg gtttcactga tcctccggtg aagcatcacc aaagcgccaa tctcctcgtt 1320 agaaagaaca tcaatgatct cacccgtgaa gaagtgttga atctgagaga ggcgtttcat 1380 aaattccaag aagatcgctc cgtcgacggt tatcaagcca cagctgagta tcatggtctt 1440 ccggccagat gccctcgccc tgatgccaag gacagatatg cttgttgtgt ccatggaatg 1500 ccaatcttcc ctcactggca caggcttttt gtcacacaag ttgaagatgc tttagtaggc 1560 cgtggagcta ccattggtat cccatactgg gactggactg aacccatgac acacattccg 1620 ggtctggcag gaaacgaaac ttatgtggat tctcatggtg catcccacac aaatcctttt 1680 catagttcag tgattgcatt tgaagaaaat gctccccaca ccaaaagaca aatagatcaa 1740 agactcttta aacccgctac ctttggacac cacacagacc tgttcaacca gattttgtat 1800 gcctttgaac aagaagatta ctgtgacttt gaagtccaat ttgagattac ccataacacg 1860 attcacgctt ggacaggagg aagcgaacat ttctcaatgt cgtccctaca ttacacagct 1920 ttcgatcctt tgttttactt tcaccattct aacgttgatc gtctttgggc cgtttggcaa 1980 gccttacaga tgagacggca taaaccctac agggcccact gcgccatatc tctggaacat 2040 atgcatctga aaccattcgc cttttcatct ccccttaaca ataacgaaaa gactcatgcc 2100 aatgccatgc caaacaagat ctacgactat gaaaatgtcc tccattacac atacgaagat 2160 ttaacatttg gaggcatctc tctggaaaac atagaaaaga tgatccacga aaaccagcaa 2220 gaagacagaa tatatgccgg ttttctcctg gctggcatac gtacttcagc aaatgttgat 2280 atcttcatta aaactaccga ttccgtgcaa cataaggctg gaacatttgc agtgctcggt 2340 ggaagcaagg aaatgaagtg gggatttgat cgcgttttca agtttgacat cacgcacgtt 2400 ttgaaagatc tcgatctcac tgctgatggc gatttcgaag ttactgttga catcactgaa 2460 gtcgatggaa ctaaacttgc atccagtctt attccacatg cttctgtcat tcgtgagcat 2520 gcacgtgtta aatttgacaa agtgccaagg agtcgtctta ttcgaaaaaa tgtagaccgt 2580 ttgagccccg aggagatgaa tgaacttcgt aaagccctag ccttactgaa agaggacaaa 2640 agtgccggtg gatttcagca gcttggtgca ttccatgggg agccaaaatg gtgtcctagt 2700 cccgaagcat ctaaaaaatt tgcctgctgt gttcacggca tgtctgtgtt ccctcactgg 2760 catcgactgt tgacggttca gagtgaaaat gctttgagac gacatggcta cgatggagct 2820 ttgccgtact gggattggac ctctcctctt aatcaccttc ccgaactggc agatcatgag 2880 aagtacgtcg accctgaaga tggggtagag aagcataacc cttggttcga tggtcatata 2940 gatacagtcg acaaaacaac aacaagaagt gttcagaata aactcttcga acagcctgag 3000 tttggtcatt atacaagcat tgccaaacaa gtactgctag cgttggaaca ggacaatttc 3060 tgtgactttg aaatccaata tgagattgcc cataactaca tccatgcact tgtaggaggc 3120 gctcagcctt atggtatggc atcgcttcgc tacactgctt ttgatccact attctacttg 3180 catcactcta atacagatcg tatatgggca atatggcagg ctttacagaa gtacagagga 3240 aaaccgtaca acgttgctaa ctgtgctgtt acatcgatga gagaaccttt gcaaccattt 3300 ggcctctctg ccaatatcaa cacagaccat gtaaccaagg agcattcagt gccattcaac 3360 gtttttgatt acaagaccaa tttcaattat gaatatgaca ctttggaatt taacggtctc 3420 tcaatctctc agttgaataa aaagctcgaa gcgataaaga gccaagacag gttctttgca 3480 ggcttcctgt tatctggttt caagaaatca tctcttgtta aattcaatat ttgcaccgat 3540 agcagcaact gtcaccccgc tggagagttt taccttctgg gtgatgaaaa cgagatgcca 3600 tgggcatacg atagagtctt caaatatgac ataaccgaaa aactccacga tctaaagctg 3660 catgcagaag accacttcta cattgactat gaagtatttg accttaaacc agcaagcctg 3720 ggaaaagatt tgttcaagca gccttcagtc attcatgaac caagaatagg tcaccatgaa 3780 ggcgaagtat atcaagctga agtaacttct gccaaccgta ttcgaaaaaa cattgaaaat 3840 ctgagccttg gtgaactcga aagtctgaga gctgccttcc tggaaattga aaacgatgga 3900 acttacgaat caatagctaa attccatggt agccctggtt tgtgccagtt aaatggtaac 3960 cccatctctt gttgtgtcca tggcatgcca actttccctc actggcacag actgtacgtg 4020 gttgtcgttg agaatgccct cctgaaaaaa ggatcatctg tagctgttcc ctattgggac 4080 tggacaaaac gaatcgaaca tttacctcac ctgatttcag acgccactta ctacaattcc 4140 aggcaacatc actatgagac aaacccattc catcatggca aaatcacaca cgagaatgaa 4200 atcactacta gggatcccaa ggacagcctc ttccattcag actactttta cgagcaggtc 4260 ctttacgcct tggagcagga taacttctgt gatttcgaga ttcagttgga gatattacac 4320 aatgcattgc attctttact tggtggcaaa ggtaaatatt ccatgtcaaa ccttgattac 4380 gctgcttttg atcctgtgtt cttccttcat cacgcaacga ctgacagaat ctgggcaatc 4440 tggcaagacc ttcagaggtt ccgaaaacgg ccataccgag aagcgaattg cgctatccaa 4500 ttgatgcaca cgccactcca gccgtttgat aagagcgaca acaatgacga ggcaacgaaa 4560 acgcatgcca ctccacatga tggttttgaa tatcaaaaca gctttggtta tgcttacgat 4620 aatctggaac tgaatcacta ctcgattcct cagcttgatc acatgctgca agaaagaaaa 4680 aggcatgaca gagtattcgc tggcttcctc cttcacaata ttggaacatc tgccgatgtc 4740 catgtatttg tatgtctccc aactggggaa cacacgaagg actgcagtca tgaggctggt 4800 atgttctcca tcttaggcgg tcaaacggag atgtcctttg tatttgacag actttacaaa 4860 cttgacataa ctaaagcctt gaaaaagaac ggtgtgcacc tgcaagggga tttcgatctg 4920 gaaattgaga ttacggctgt gaatggatct catctagaca gtcatgtcat ccactctccc 4980 actatactgt ttgaggccgg aacagattct gcccacacag atgatggaca cactgaacca 5040 gtgatgattc gcaaagatat cacacaattg gacaagcgtc aacaactgtc actggtgaaa 5100 gccctcgagt ccatgaaagc cgaccattca tctgatgggt tccaggcaat cgcttccttc 5160 catgctcttc ctcctctttg tccatcacca gctgcttcaa agaggtttgc gtgctgcgtc 5220 catggcatgg caacgttccc acaatggcac cgtctgtaca cagtccaatt ccaagattct 5280 ctcagaaaac atggtgcagt cgttggactt ccgtactggg actggaccct acctcgttct 5340 gaattaccag agctcctgac cgtctcaact attcatgacc cggagacagg cagagatata 5400 ccaaatccat ttattggttc taaaatagag tttgaaggag aaaacgtaca tactaaaaga 5460 gatatcaata gggatcgtct cttccaggga tcaacaaaaa cacatcataa ctggtttatt 5520 gagcaagcac tgcttgctct tgaacaaacc aactactgcg acttcgaggt tcagtttgaa 5580 attatgcata atggtgttca tacctgggtt ggaggcaagg agccctatgg aattggccat 5640 ctgcattatg cttcctatga tccacttttc tacatccatc actcccaaac tgatcgtatt 5700 tgggctatat ggcaatcgtt gcagcgtttc agaggacttt ctggatctga ggctaactgt 5760 gctgtaaatc tcatgaaaac tcctctgaag cctttcagct ttggagcacc atataatctt 5820 aatgatcaca cgcatgattt ctcaaagcct gaagatacat tcgactacca aaagtttgga 5880 tacatatatg acactctgga atttgcaggg tggtcaattc gtggcattga ccatattgtc 5940 cgtaacaggc aggaacattc aagggtcttt gccggattct tgcttgaagg atttggcacc 6000 tctgccactg tcgatttcca ggtctgtcgc acagcgggag actgtgaaga tgcagggtac 6060 ttcaccgtgt tgggaggtga aaaagaaatg ccttgggcct ttgatcggct ttacaagtac 6120 gacataacag aaaccttaga caagatgaac cttcgacatg acgaaatctt ccagattgaa 6180 gtaaccatta catcctacga tggaactgta ctcgatagtg gccttattcc cacaccgtca 6240 atcatctatg atcctgctca tcatgatatt agttcgcacc acctgtcgct caacaaggtt 6300 cgtcatgatc tgagtacact gagtgagcga gatattggaa gccttaaata tgctttgagc 6360 agcttgcagg cagatacctc agcagatggt tttgctgcca ttgcatcctt ccatggtctg 6420 cctgccaaat gtaatgacag ccacaataac gaggtggcat gctgtatcca tggaatgcct 6480 acattccccc actggcacag actctacacc ctccaatttg agcaagctct aagaagacat 6540 ggctctagtg tagcagtacc ctactgggac tggacaaagc caatacataa tattccacat 6600 ctgttcacag acaaagaata ctacgatgtc tggagaaata aagtaatgcc aaatccattt 6660 gcccgagggt atgtcccctc acacgataca tacacggtaa gagacgtcca agaaggcctg 6720 ttccacctga catcaacggg tgaacactca gcgcttctga atcaagctct tttggcgctg 6780 gaacagcacg actactgcga ttttgcagtc cagtttgaag tcatgcacaa cacaatccat 6840 tacctagtgg gaggacctca agtctattct ttgtcatccc ttcattatgc ttcatatgat 6900 ccgatcttct tcatacacca ctcctttgta gacaaggttt gggctgtctg gcaggctctt 6960 caagaaaaga gaggccttcc atcagaccgt gctgactgcg ctgttagtct catgactcag 7020 aacatgaggc ctttccatca cgaaattaac cataaccagt tcaccaagaa acatgcagtt 7080 ccaaatgatg ttttcaagta cgaactcctg ggttacagat acgacaatct ggaaatcggt 7140 ggcatgaatt tgcatgaaat tgaaaaggaa atcaaagaca aacagcacca tgtgagagtg 7200 tttgcagggt tcctccttca cggaattaga acctcagctg atgtccaatt ccagatttgt 7260 aaaacatcag aagattgtca ccatggaggc caaatcttcg ttcttggggg gactaaagag 7320 atggcctggg cttataaccg tttattcaag tacgatatta cccatgctct tcatgacgca 7380 cacatcactc cagaagacgt attccatccc tctgaaccat tcttcatcaa ggtgtcagtg 7440 acagccgtca acggaacagt tcttccggct tcaatcctgc atgcaccaac cattatctat 7500 gaacctggtc tcgaccatca cgaagatcat cattcttctt ctatggctgg acatggtgtc 7560 agaaaggaaa tcaacacact taccactgca gaggtggaca atctcaaaga tgccatgaga 7620 gccgtcatgg cagaccacgg tccaaatgga taccaggcta tagcagcgtt ccatggaaac 7680 ccaccaatgt gccctatgcc agatggaaag aattactcgt gttgtacaca tggcatggct 7740 actttccccc actggcacag actgtacaca aaacagatgg aagatgcctt gaccgcccat 7800 ggtgccagag tcggccttcc ttactgggac tggacaactg cctttacagc tttgccaact 7860 tttgtcacag atgaagagga caatcctttc catcatggtc acatagacta tttgggagtg 7920 gatacaactc ggtcgccccg agacaagttg ttcaatgatc cagagcgagg atcagaatcg 7980 ttcttctaca ggcaggttct cttggctttg gagcagacag atttttgtca gtttgaagtc 8040 cagttcgaga ttacccacaa tgccgtccat tcttggactg gtggactgac cccctacgga 8100 atgtccactc tggagtacac gacctatgat cctcttttct ggcttcatca tgccaatact 8160 gaccgaatat gggctatttg gcaagctctt caagaataca gaggtctccc atacgaccat 8220 gccaactgcg aaatccaggc aatgaagagg cccctacggc cattctctga tcctattaat 8280 cacaatgcgt tcactcattc taacgcaaaa ccaacagacg ttttcgagta cagtagattc 8340 aacttccagt atgataactt acggtttcat ggtatgacaa taaaaaagtt ggagcatgag 8400 cttgagaaac agaaggaaga agacaggaca ttcgctgcct tcctgctcca cggcattaaa 8460 aaaagtgctg atgtaagctt tgacgtgtgc aaccatgatg gagaatgcca ctttgcaggt 8520 accttcgcta ttcttggagg cgagcacgag atgccttggt cctttgacag acttttccga 8580 tatgacatca cccaagttct taagcaaatg cacctggagt atgattctga tttcacgttc 8640 catatgagaa ttattgacac ttcgggtaaa caactgccat cagacctgat caagatgcca 8700 acggttgaac acagcccagg aggaaaacat catgagaaac atcatgaaga tcaccatgaa 8760 gacatacttg ttcgtaaaaa cattcactct ctgtcacacc atgaagccga agaattgaga 8820 gatgcattgt acaagcttca aaacgatgaa agtcacggtg gatatgaaca tattgctgga 8880 ttccatggtt atcctaacct ctgtcctgaa aaaggggatg aaaagtatcc ttgctgtgtt 8940 catggtatgt ctatattccc ccactggcac aggctccaca cgatacagtt tgaaagagct 9000 ctgaagaaac atggttccca tctcggtatt ccttactggg attggactca gacgatatcg 9060 tctcttccta ccttctttgc agattctggt aataataatc cattcttcaa ataccatatc 9120 agaagcataa atcaagacac tgttcgagac gttaatgaag cgatattcca acaaacgaag 9180 tttggagaat tttcgtccat attttacctg gccttgcagg ctctagaaga ggataattat 9240 tgcgattttg aggtgcagta cgagatactt cacaatgagg tgcatgcctt gattggtgga 9300 gccgagaagt attccatgtc cacgctggaa tattcagcct tcgatccata cttcatgatc 9360 caccacgcgt ctttggacaa gatctggata atctggcagg agcttcagaa acgccgggtt 9420 aagcctgcac atgctggttc gtgtgccgga gacattatgc acgttccact tcatcctttc 9480 aattatgaaa gcgtcaacaa cgatgacttt accagagaga attctctccc caacgcggtc 9540 gttgacagtc atagattcaa ctataaatat gacaatctta atcttcacgg ccacaatata 9600 gaggagcttg aggaggttct ccgcagcctg agacttaaat ctcgtgtttt cgcaggattt 9660 gttctgtccg gaatccgcac aactgcagtt gtgaaagtgt atatcaaaag cggaaccgat 9720 tccgacgacg aatatgccgg ttcgtttgtc atccttggag gagccaaaga gatgccttgg 9780 gcatatgaga ggttgtatag atttgacatt acagaaactg tacacaatct taaccttacg 9840 gatgatcatg ttaaattcag gtttgactta aagaaatatg atcatactga attggatgca 9900 tcagtactgc cagctcctat aatcgttcga cgtccaaata atgcggtttt tgatatcatt 9960 gagattccaa tagggaaaga cgtgaatctt ccaccgaagg tggtcgtcaa gagaggcact 10020 aaaatcatgt tcatgtctgt tgatgaagcc gttaccactc ccatgctgaa cctgggtagt 10080 tatacagcaa tgttcaaatg taaagtgcca ccattcagct tccatgcttt tgagcttggg 10140 aaaatgtatt ctgtcgaatc tggtgactat ttcatgacgg catcaacaac tgaattatgc 10200 aacgataata acttaaggat acatgtccat gtggatgacg aatag 10245 29 10245 DNA Megathura crenulata SEQ ID NO29 is the complementary sequence of the coding sequence of SEQ ID NO28 29 ctattcgtca tccacatgga catgtatcct taagttatta tcgttgcata attcagttgt 60 tgatgccgtc atgaaatagt caccagattc gacagaatac attttcccaa gctcaaaagc 120 atggaagctg aatggtggca ctttacattt gaacattgct gtataactac ccaggttcag 180 catgggagtg gtaacggctt catcaacaga catgaacatg attttagtgc ctctcttgac 240 gaccaccttc ggtggaagat tcacgtcttt ccctattgga atctcaatga tatcaaaaac 300 cgcattattt ggacgtcgaa cgattatagg agctggcagt actgatgcat ccaattcagt 360 atgatcatat ttctttaagt caaacctgaa tttaacatga tcatccgtaa ggttaagatt 420 gtgtacagtt tctgtaatgt caaatctata caacctctca tatgcccaag gcatctcttt 480 ggctcctcca aggatgacaa acgaaccggc atattcgtcg tcggaatcgg ttccgctttt 540 gatatacact ttcacaactg cagttgtgcg gattccggac agaacaaatc ctgcgaaaac 600 acgagattta agtctcaggc tgcggagaac ctcctcaagc tcctctatat tgtggccgtg 660 aagattaaga ttgtcatatt tatagttgaa tctatgactg tcaacgaccg cgttggggag 720 agaattctct ctggtaaagt catcgttgtt gacgctttca taattgaaag gatgaagtgg 780 aacgtgcata atgtctccgg cacacgaacc agcatgtgca ggcttaaccc ggcgtttctg 840 aagctcctgc cagattatcc agatcttgtc caaagacgcg tggtggatca tgaagtatgg 900 atcgaaggct gaatattcca gcgtggacat ggaatacttc tcggctccac caatcaaggc 960 atgcacctca ttgtgaagta tctcgtactg cacctcaaaa tcgcaataat tatcctcttc 1020 tagagcctgc aaggccaggt aaaatatgga cgaaaattct ccaaacttcg tttgttggaa 1080 tatcgcttca ttaacgtctc gaacagtgtc ttgatttatg cttctgatat ggtatttgaa 1140 gaatggatta ttattaccag aatctgcaaa gaaggtagga agagacgata tcgtctgagt 1200 ccaatcccag taaggaatac cgagatggga accatgtttc ttcagagctc tttcaaactg 1260 tatcgtgtgg agcctgtgcc agtgggggaa tatagacata ccatgaacac agcaaggata 1320 cttttcatcc cctttttcag gacagaggtt aggataacca tggaatccag caatatgttc 1380 atatccaccg tgactttcat cgttttgaag cttgtacaat gcatctctca attcttcggc 1440 ttcatggtgt gacagagagt gaatgttttt acgaacaagt atgtcttcat ggtgatcttc 1500 atgatgtttc tcatgatgtt ttcctcctgg gctgtgttca accgttggca tcttgatcag 1560 gtctgatggc agttgtttac ccgaagtgtc aataattctc atatggaacg tgaaatcaga 1620 atcatactcc aggtgcattt gcttaagaac ttgggtgatg tcatatcgga aaagtctgtc 1680 aaaggaccaa ggcatctcgt gctcgcctcc aagaatagcg aaggtacctg caaagtggca 1740 ttctccatca tggttgcaca cgtcaaagct tacatcagca ctttttttaa tgccgtggag 1800 caggaaggca gcgaatgtcc tgtcttcttc cttctgtttc tcaagctcat gctccaactt 1860 ttttattgtc ataccatgaa accgtaagtt atcatactgg aagttgaatc tactgtactc 1920 gaaaacgtct gttggttttg cgttagaatg agtgaacgca ttgtgattaa taggatcaga 1980 gaatggccgt aggggcctct tcattgcctg gatttcgcag ttggcatggt cgtatgggag 2040 acctctgtat tcttgaagag cttgccaaat agcccatatt cggtcagtat tggcatgatg 2100 aagccagaaa agaggatcat aggtcgtgta ctccagagtg gacattccgt agggggtcag 2160 tccaccagtc caagaatgga cggcattgtg ggtaatctcg aactggactt caaactgaca 2220 aaaatctgtc tgctccaaag ccaagagaac ctgcctgtag aagaacgatt ctgatcctcg 2280 ctctggatca ttgaacaact tgtctcgggg cgaccgagtt gtatccactc ccaaatagtc 2340 tatgtgacca tgatggaaag gattgtcctc ttcatctgtg acaaaagttg gcaaagctgt 2400 aaaggcagtt gtccagtccc agtaaggaag gccgactctg gcaccatggg cggtcaaggc 2460 atcttccatc tgttttgtgt acagtctgtg ccagtggggg aaagtagcca tgccatgtgt 2520 acaacacgag taattctttc catctggcat agggcacatt ggtgggtttc catggaacgc 2580 tgctatagcc tggtatccat ttggaccgtg gtctgccatg acggctctca tggcatcttt 2640 gagattgtcc acctctgcag tggtaagtgt gttgatttcc tttctgacac catgtccagc 2700 catagaagaa gaatgatgat cttcgtgatg gtcgagacca ggttcataga taatggttgg 2760 tgcatgcagg attgaagccg gaagaactgt tccgttgacg gctgtcactg acaccttgat 2820 gaagaatggt tcagagggat ggaatacgtc ttctggagtg atgtgtgcgt catgaagagc 2880 atgggtaata tcgtacttga ataaacggtt ataagcccag gccatctctt tagtcccccc 2940 aagaacgaag atttggcctc catggtgaca atcttctgat gttttacaaa tctggaattg 3000 gacatcagct gaggttctaa ttccgtgaag gaggaaccct gcaaacactc tcacatggtg 3060 ctgtttgtct ttgatttcct tttcaatttc atgcaaattc atgccaccga tttccagatt 3120 gtcgtatctg taacccagga gttcgtactt gaaaacatca tttggaactg catgtttctt 3180 ggtgaactgg ttatggttaa tttcgtgatg gaaaggcctc atgttctgag tcatgagact 3240 aacagcgcag tcagcacggt ctgatggaag gcctctcttt tcttgaagag cctgccagac 3300 agcccaaacc ttgtctacaa aggagtggtg tatgaagaag atcggatcat atgaagcata 3360 atgaagggat gacaaagaat agacttgagg tcctcccact aggtaatgga ttgtgttgtg 3420 catgacttca aactggactg caaaatcgca gtagtcgtgc tgttccagcg ccaaaagagc 3480 ttgattcaga agcgctgagt gttcacccgt tgatgtcagg tggaacaggc cttcttggac 3540 gtctcttacc gtgtatgtat cgtgtgaggg gacataccct cgggcaaatg gatttggcat 3600 tactttattt ctccagacat cgtagtattc tttgtctgtg aacagatgtg gaatattatg 3660 tattggcttt gtccagtccc agtagggtac tgctacacta gagccatgtc ttcttagagc 3720 ttgctcaaat tggagggtgt agagtctgtg ccagtggggg aatgtaggca ttccatggat 3780 acagcatgcc acctcgttat tgtggctgtc attacatttg gcaggcagac catggaagga 3840 tgcaatggca gcaaaaccat ctgctgaggt atctgcctgc aagctgctca aagcatattt 3900 aaggcttcca atatctcgct cactcagtgt actcagatca tgacgaacct tgttgagcga 3960 caggtggtgc gaactaatat catgatgagc aggatcatag atgattgacg gtgtgggaat 4020 aaggccacta tcgagtacag ttccatcgta ggatgtaatg gttacttcaa tctggaagat 4080 ttcgtcatgt cgaaggttca tcttgtctaa ggtttctgtt atgtcgtact tgtaaagccg 4140 atcaaaggcc caaggcattt ctttttcacc tcccaacacg gtgaagtacc ctgcatcttc 4200 acagtctccc gctgtgcgac agacctggaa atcgacagtg gcagaggtgc caaatccttc 4260 aagcaagaat ccggcaaaga cccttgaatg ttcctgcctg ttacggacaa tatggtcaat 4320 gccacgaatt gaccaccctg caaattccag agtgtcatat atgtatccaa acttttggta 4380 gtcgaatgta tcttcaggct ttgagaaatc atgcgtgtga tcattaagat tatatggtgc 4440 tccaaagctg aaaggcttca gaggagtttt catgagattt acagcacagt tagcctcaga 4500 tccagaaagt cctctgaaac gctgcaacga ttgccatata gcccaaatac gatcagtttg 4560 ggagtgatgg atgtagaaaa gtggatcata ggaagcataa tgcagatggc caattccata 4620 gggctccttg cctccaaccc aggtatgaac accattatgc ataatttcaa actgaacctc 4680 gaagtcgcag tagttggttt gttcaagagc aagcagtgct tgctcaataa accagttatg 4740 atgtgttttt gttgatccct ggaagagacg atccctattg atatctcttt tagtatgtac 4800 gttttctcct tcaaactcta ttttagaacc aataaatgga tttggtatat ctctgcctgt 4860 ctccgggtca tgaatagttg agacggtcag gagctctggt aattcagaac gaggtagggt 4920 ccagtcccag tacggaagtc caacgactgc accatgtttt ctgagagaat cttggaattg 4980 gactgtgtac agacggtgcc attgtgggaa cgttgccatg ccatggacgc agcacgcaaa 5040 cctctttgaa gcagctggtg atggacaaag aggaggaaga gcatggaagg aagcgattgc 5100 ctggaaccca tcagatgaat ggtcggcttt catggactcg agggctttca ccagtgacag 5160 ttgttgacgc ttgtccaatt gtgtgatatc tttgcgaatc atcactggtt cagtgtgtcc 5220 atcatctgtg tgggcagaat ctgttccggc ctcaaacagt atagtgggag agtggatgac 5280 atgactgtct agatgagatc cattcacagc cgtaatctca atttccagat cgaaatcccc 5340 ttgcaggtgc acaccgttct ttttcaaggc tttagttatg tcaagtttgt aaagtctgtc 5400 aaatacaaag gacatctccg tttgaccgcc taagatggag aacataccag cctcatgact 5460 gcagtccttc gtgtgttccc cagttgggag acatacaaat acatggacat cggcagatgt 5520 tccaatattg tgaaggagga agccagcgaa tactctgtca tgcctttttc tttcttgcag 5580 catgtgatca agctgaggaa tcgagtagtg attcagttcc agattatcgt aagcataacc 5640 aaagctgttt tgatattcaa aaccatcatg tggagtggca tgcgttttcg ttgcctcgtc 5700 attgttgtcg ctcttatcaa acggctggag tggcgtgtgc atcaattgga tagcgcaatt 5760 cgcttctcgg tatggccgtt ttcggaacct ctgaaggtct tgccagattg cccagattct 5820 gtcagtcgtt gcgtgatgaa ggaagaacac aggatcaaaa gcagcgtaat caaggtttga 5880 catggaatat ttacctttgc caccaagtaa agaatgcaat gcattgtgta atatctccaa 5940 ctgaatctcg aaatcacaga agttatcctg ctccaaggcg taaaggacct gctcgtaaaa 6000 gtagtctgaa tggaagaggc tgtccttggg atccctagta gtgatttcat tctcgtgtgt 6060 gattttgcca tgatggaatg ggtttgtctc atagtgatgt tgcctggaat tgtagtaagt 6120 ggcgtctgaa atcaggtgag gtaaatgttc gattcgtttt gtccagtccc aatagggaac 6180 agctacagat gatccttttt tcaggagggc attctcaacg acaaccacgt acagtctgtg 6240 ccagtgaggg aaagttggca tgccatggac acaacaagag atggggttac catttaactg 6300 gcacaaacca gggctaccat ggaatttagc tattgattcg taagttccat cgttttcaat 6360 ttccaggaag gcagctctca gactttcgag ttcaccaagg ctcagatttt caatgttttt 6420 tcgaatacgg ttggcagaag ttacttcagc ttgatatact tcgccttcat ggtgacctat 6480 tcttggttca tgaatgactg aaggctgctt gaacaaatct tttcccaggc ttgctggttt 6540 aaggtcaaat acttcatagt caatgtagaa gtggtcttct gcatgcagct ttagatcgtg 6600 gagtttttcg gttatgtcat atttgaagac tctatcgtat gcccatggca tctcgttttc 6660 atcacccaga aggtaaaact ctccagcggg gtgacagttg ctgctatcgg tgcaaatatt 6720 gaatttaaca agagatgatt tcttgaaacc agataacagg aagcctgcaa agaacctgtc 6780 ttggctcttt atcgcttcga gctttttatt caactgagag attgagagac cgttaaattc 6840 caaagtgtca tattcataat tgaaattggt cttgtaatca aaaacgttga atggcactga 6900 atgctccttg gttacatggt ctgtgttgat attggcagag aggccaaatg gttgcaaagg 6960 ttctctcatc gatgtaacag cacagttagc aacgttgtac ggttttcctc tgtacttctg 7020 taaagcctgc catattgccc atatacgatc tgtattagag tgatgcaagt agaatagtgg 7080 atcaaaagca gtgtagcgaa gcgatgccat accataaggc tgagcgcctc ctacaagtgc 7140 atggatgtag ttatgggcaa tctcatattg gatttcaaag tcacagaaat tgtcctgttc 7200 caacgctagc agtacttgtt tggcaatgct tgtataatga ccaaactcag gctgttcgaa 7260 gagtttattc tgaacacttc ttgttgttgt tttgtcgact gtatctatat gaccatcgaa 7320 ccaagggtta tgcttctcta ccccatcttc agggtcgacg tacttctcat gatctgccag 7380 ttcgggaagg tgattaagag gagaggtcca atcccagtac ggcaaagctc catcgtagcc 7440 atgtcgtctc aaagcatttt cactctgaac cgtcaacagt cgatgccagt gagggaacac 7500 agacatgccg tgaacacagc aggcaaattt tttagatgct tcgggactag gacaccattt 7560 tggctcccca tggaatgcac caagctgctg aaatccaccg gcacttttgt cctctttcag 7620 taaggctagg gctttacgaa gttcattcat ctcctcgggg ctcaaacggt ctacattttt 7680 tcgaataaga cgactccttg gcactttgtc aaatttaaca cgtgcatgct cacgaatgac 7740 agaagcatgt ggaataagac tggatgcaag tttagttcca tcgacttcag tgatgtcaac 7800 agtaacttcg aaatcgccat cagcagtgag atcgagatct ttcaaaacgt gcgtgatgtc 7860 aaacttgaaa acgcgatcaa atccccactt catttccttg cttccaccga gcactgcaaa 7920 tgttccagcc ttatgttgca cggaatcggt agttttaatg aagatatcaa catttgctga 7980 agtacgtatg ccagccagga gaaaaccggc atatattctg tcttcttgct ggttttcgtg 8040 gatcatcttt tctatgtttt ccagagagat gcctccaaat gttaaatctt cgtatgtgta 8100 atggaggaca ttttcatagt cgtagatctt gtttggcatg gcattggcat gagtcttttc 8160 gttattgtta aggggagatg aaaaggcgaa tggtttcaga tgcatatgtt ccagagatat 8220 ggcgcagtgg gccctgtagg gtttatgccg tctcatctgt aaggcttgcc aaacggccca 8280 aagacgatca acgttagaat ggtgaaagta aaacaaagga tcgaaagctg tgtaatgtag 8340 ggacgacatt gagaaatgtt cgcttcctcc tgtccaagcg tgaatcgtgt tatgggtaat 8400 ctcaaattgg acttcaaagt cacagtaatc ttcttgttca aaggcataca aaatctggtt 8460 gaacaggtct gtgtggtgtc caaaggtagc gggtttaaag agtctttgat ctatttgtct 8520 tttggtgtgg ggagcatttt cttcaaatgc aatcactgaa ctatgaaaag gatttgtgtg 8580 ggatgcacca tgagaatcca cataagtttc gtttcctgcc agacccggaa tgtgtgtcat 8640 gggttcagtc cagtcccagt atgggatacc aatggtagct ccacggccta ctaaagcatc 8700 ttcaacttgt gtgacaaaaa gcctgtgcca gtgagggaag attggcattc catggacaca 8760 acaagcatat ctgtccttgg catcagggcg agggcatctg gccggaagac catgatactc 8820 agctgtggct tgataaccgt cgacggagcg atcttcttgg aatttatgaa acgcctctct 8880 cagattcaac acttcttcac gggtgagatc attgatgttc tttctaacga ggagattggc 8940 gctttggtga tgcttcaccg gaggatcagt gaaacccttc tccggatgat gaacaacaac 9000 tggatgaggt aacaaatcat cgggaagagc tgtaccattg acactgaaaa gctctgcctt 9060 gacggtatag tgtccatcca acttcattcc catcttgtga acagtgtcgg acaagtcaaa 9120 caagtatggc ctgtagaact gccatttcat ctcgcttggt ccgcccagaa cgaagaaatc 9180 accggccttc tcacagtggt catcattacg atcgttgtcg tcggggatgc agacgtacac 9240 aaagacattg gctgaacctc caaatccttt cagacggaag cttgcgaagg ctcttgccct 9300 tgaagaccgc tcatccaaat aggttttcag ttcctctgga gtcataccat taagattcaa 9360 cgtgtcatag tcgtagtgaa gttgacgata gtcaaagaga tcggctggag tggaatgttc 9420 atttgttagt ggtatgtcat tgtcctccca actgaaaggt tccattttct gatgtaacaa 9480 ttcttgggca caatccatag ctttggggtc tttgtttcgg agctcttgga gcctctgcca 9540 gatggcgaaa attctgtcaa cgttggagtg gtgaagaaag aagatcggat cgtaggcggt 9600 gtactcgagg tttgccatag agtaatcatg ctttccaccg accaggtagt gaatcgcatt 9660 atgtgctaac tcgaactgga tttcgaattt acagaattca tcctgctcca gtgcgtccaa 9720 aacactctcc ataagatggg tgtgctggcc tggcttcacc ttctcaaaaa gacggtcgtc 9780 gaccgcacga ttggttttct tgtctagaaa gtcaatgttc cctctgtacc agtagttggt 9840 gtgagccttt ccgccctggg aatcgatgta aacagggtcc gcagcgagag agggcaactg 9900 ggtgatgggt tctgtccagt cccagtatgg catcccagaa gtacgcctct tcgtctgcag 9960 agctcgctcc atgtgaacta cgtacgccct gtgccagtgc gggaaagtag gcatgccgtg 10020 aatacagcaa gctatagaag tgtccttatg cacacaagat gcgggcgcgc catgagcagc 10080 agctattttc tggaaaccaa tgctggatga gtccatctgc agctcacgca gggcagcctg 10140 caggtccagg gtctcttcct gagtcagatg ctccacactc tttctgacca ggttctctgc 10200 attggccaaa gccaacacga ctataagcaa cctgaccgac agcat 10245 30 3414 PRT Megathura crenulata SEQ ID NO30 is the deduced amino acid sequence of SEQ ID NO28 30 Met Leu Ser Val Arg Leu Leu Ile Val Val Leu Ala Leu Ala Asn Ala 1 5 10 15 Glu Asn Leu Val Arg Lys Ser Val Glu His Leu Thr Gln Glu Glu Thr 20 25 30 Leu Asp Leu Gln Ala Ala Leu Arg Glu Leu Gln Met Asp Ser Ser Ser 35 40 45 Ile Gly Phe Gln Lys Ile Ala Ala Ala His Gly Ala Pro Ala Ser Cys 50 55 60 Val His Lys Asp Thr Ser Ile Ala Cys Cys Ile His Gly Met Pro Thr 65 70 75 80 Phe Pro His Trp His Arg Ala Tyr Val Val His Met Glu Arg Ala Leu 85 90 95 Gln Thr Lys Arg Arg Thr Ser Gly Met Pro Tyr Trp Asp Trp Thr Glu 100 105 110 Pro Ile Thr Gln Leu Pro Ser Leu Ala Ala Asp Pro Val Tyr Ile Asp 115 120 125 Ser Gln Gly Gly Lys Ala His Thr Asn Tyr Trp Tyr Arg Gly Asn Ile 130 135 140 Asp Phe Leu Asp Lys Lys Thr Asn Arg Ala Val Asp Asp Arg Leu Phe 145 150 155 160 Glu Lys Val Lys Pro Gly Gln His Thr His Leu Met Glu Ser Val Leu 165 170 175 Asp Ala Leu Glu Gln Asp Glu Phe Cys Lys Phe Glu Ile Gln Phe Glu 180 185 190 Leu Ala His Asn Ala Ile His Tyr Leu Val Gly Gly Lys His Asp Tyr 195 200 205 Ser Met Ala Asn Leu Glu Tyr Thr Ala Tyr Asp Pro Ile Phe Phe Leu 210 215 220 His His Ser Asn Val Asp Arg Ile Phe Ala Ile Trp Gln Arg Leu Gln 225 230 235 240 Glu Leu Arg Asn Lys Asp Pro Lys Ala Met Asp Cys Ala Gln Glu Leu 245 250 255 Leu His Gln Lys Met Glu Pro Phe Ser Trp Glu Asp Asn Asp Ile Pro 260 265 270 Leu Thr Asn Glu His Ser Thr Pro Ala Asp Leu Phe Asp Tyr Arg Gln 275 280 285 Leu His Tyr Asp Tyr Asp Thr Leu Asn Leu Asn Gly Met Thr Pro Glu 290 295 300 Glu Leu Lys Thr Tyr Leu Asp Glu Arg Ser Ser Arg Ala Arg Ala Phe 305 310 315 320 Ala Ser Phe Arg Leu Lys Gly Phe Gly Gly Ser Ala Asn Val Phe Val 325 330 335 Tyr Val Cys Ile Pro Asp Asp Asn Asp Arg Asn Asp Asp His Cys Glu 340 345 350 Lys Ala Gly Asp Phe Phe Val Leu Gly Gly Pro Ser Glu Met Lys Trp 355 360 365 Gln Phe Tyr Arg Pro Tyr Leu Phe Asp Leu Ser Asp Thr Val His Lys 370 375 380 Met Gly Met Lys Leu Asp Gly His Tyr Thr Val Lys Ala Glu Leu Phe 385 390 395 400 Ser Val Asn Gly Thr Ala Leu Pro Asp Asp Leu Leu Pro His Pro Val 405 410 415 Val Val His His Pro Glu Lys Gly Phe Thr Asp Pro Pro Val Lys His 420 425 430 His Gln Ser Ala Asn Leu Leu Val Arg Lys Asn Ile Asn Asp Leu Thr 435 440 445 Arg Glu Glu Val Leu Asn Leu Arg Glu Ala Phe His Lys Phe Gln Glu 450 455 460 Asp Arg Ser Val Asp Gly Tyr Gln Ala Thr Ala Glu Tyr His Gly Leu 465 470 475 480 Pro Ala Arg Cys Pro Arg Pro Asp Ala Lys Asp Arg Tyr Ala Cys Cys 485 490 495 Val His Gly Met Pro Ile Phe Pro His Trp His Arg Leu Phe Val Thr 500 505 510 Gln Val Glu Asp Ala Leu Val Gly Arg Gly Ala Thr Ile Gly Ile Pro 515 520 525 Tyr Trp Asp Trp Thr Glu Pro Met Thr His Ile Pro Gly Leu Ala Gly 530 535 540 Asn Glu Thr Tyr Val Asp Ser His Gly Ala Ser His Thr Asn Pro Phe 545 550 555 560 His Ser Ser Val Ile Ala Phe Glu Glu Asn Ala Pro His Thr Lys Arg 565 570 575 Gln Ile Asp Gln Arg Leu Phe Lys Pro Ala Thr Phe Gly His His Thr 580 585 590 Asp Leu Phe Asn Gln Ile Leu Tyr Ala Phe Glu Gln Glu Asp Tyr Cys 595 600 605 Asp Phe Glu Val Gln Phe Glu Ile Thr His Asn Thr Ile His Ala Trp 610 615 620 Thr Gly Gly Ser Glu His Phe Ser Met Ser Ser Leu His Tyr Thr Ala 625 630 635 640 Phe Asp Pro Leu Phe Tyr Phe His His Ser Asn Val Asp Arg Leu Trp 645 650 655 Ala Val Trp Gln Ala Leu Gln Met Arg Arg His Lys Pro Tyr Arg Ala 660 665 670 His Cys Ala Ile Ser Leu Glu His Met His Leu Lys Pro Phe Ala Phe 675 680 685 Ser Ser Pro Leu Asn Asn Asn Glu Lys Thr His Ala Asn Ala Met Pro 690 695 700 Asn Lys Ile Tyr Asp Tyr Glu Asn Val Leu His Tyr Thr Tyr Glu Asp 705 710 715 720 Leu Thr Phe Gly Gly Ile Ser Leu Glu Asn Ile Glu Lys Met Ile His 725 730 735 Glu Asn Gln Gln Glu Asp Arg Ile Tyr Ala Gly Phe Leu Leu Ala Gly 740 745 750 Ile Arg Thr Ser Ala Asn Val Asp Ile Phe Ile Lys Thr Thr Asp Ser 755 760 765 Val Gln His Lys Ala Gly Thr Phe Ala Val Leu Gly Gly Ser Lys Glu 770 775 780 Met Lys Trp Gly Phe Asp Arg Val Phe Lys Phe Asp Ile Thr His Val 785 790 795 800 Leu Lys Asp Leu Asp Leu Thr Ala Asp Gly Asp Phe Glu Val Thr Val 805 810 815 Asp Ile Thr Glu Val Asp Gly Thr Lys Leu Ala Ser Ser Leu Ile Pro 820 825 830 His Ala Ser Val Ile Arg Glu His Ala Arg Val Lys Phe Asp Lys Val 835 840 845 Pro Arg Ser Arg Leu Ile Arg Lys Asn Val Asp Arg Leu Ser Pro Glu 850 855 860 Glu Met Asn Glu Leu Arg Lys Ala Leu Ala Leu Leu Lys Glu Asp Lys 865 870 875 880 Ser Ala Gly Gly Phe Gln Gln Leu Gly Ala Phe His Gly Glu Pro Lys 885 890 895 Trp Cys Pro Ser Pro Glu Ala Ser Lys Lys Phe Ala Cys Cys Val His 900 905 910 Gly Met Ser Val Phe Pro His Trp His Arg Leu Leu Thr Val Gln Ser 915 920 925 Glu Asn Ala Leu Arg Arg His Gly Tyr Asp Gly Ala Leu Pro Tyr Trp 930 935 940 Asp Trp Thr Ser Pro Leu Asn His Leu Pro Glu Leu Ala Asp His Glu 945 950 955 960 Lys Tyr Val Asp Pro Glu Asp Gly Val Glu Lys His Asn Pro Trp Phe 965 970 975 Asp Gly His Ile Asp Thr Val Asp Lys Thr Thr Thr Arg Ser Val Gln 980 985 990 Asn Lys Leu Phe Glu Gln Pro Glu Phe Gly His Tyr Thr Ser Ile Ala 995 1000 1005 Lys Gln Val Leu Leu Ala Leu Glu Gln Asp Asn Phe Cys Asp Phe 1010 1015 1020 Glu Ile Gln Tyr Glu Ile Ala His Asn Tyr Ile His Ala Leu Val 1025 1030 1035 Gly Gly Ala Gln Pro Tyr Gly Met Ala Ser Leu Arg Tyr Thr Ala 1040 1045 1050 Phe Asp Pro Leu Phe Tyr Leu His His Ser Asn Thr Asp Arg Ile 1055 1060 1065 Trp Ala Ile Trp Gln Ala Leu Gln Lys Tyr Arg Gly Lys Pro Tyr 1070 1075 1080 Asn Val Ala Asn Cys Ala Val Thr Ser Met Arg Glu Pro Leu Gln 1085 1090 1095 Pro Phe Gly Leu Ser Ala Asn Ile Asn Thr Asp His Val Thr Lys 1100 1105 1110 Glu His Ser Val Pro Phe Asn Val Phe Asp Tyr Lys Thr Asn Phe 1115 1120 1125 Asn Tyr Glu Tyr Asp Thr Leu Glu Phe Asn Gly Leu Ser Ile Ser 1130 1135 1140 Gln Leu Asn Lys Lys Leu Glu Ala Ile Lys Ser Gln Asp Arg Phe 1145 1150 1155 Phe Ala Gly Phe Leu Leu Ser Gly Phe Lys Lys Ser Ser Leu Val 1160 1165 1170 Lys Phe Asn Ile Cys Thr Asp Ser Ser Asn Cys His Pro Ala Gly 1175 1180 1185 Glu Phe Tyr Leu Leu Gly Asp Glu Asn Glu Met Pro Trp Ala Tyr 1190 1195 1200 Asp Arg Val Phe Lys Tyr Asp Ile Thr Glu Lys Leu His Asp Leu 1205 1210 1215 Lys Leu His Ala Glu Asp His Phe Tyr Ile Asp Tyr Glu Val Phe 1220 1225 1230 Asp Leu Lys Pro Ala Ser Leu Gly Lys Asp Leu Phe Lys Gln Pro 1235 1240 1245 Ser Val Ile His Glu Pro Arg Ile Gly His His Glu Gly Glu Val 1250 1255 1260 Tyr Gln Ala Glu Val Thr Ser Ala Asn Arg Ile Arg Lys Asn Ile 1265 1270 1275 Glu Asn Leu Ser Leu Gly Glu Leu Glu Ser Leu Arg Ala Ala Phe 1280 1285 1290 Leu Glu Ile Glu Asn Asp Gly Thr Tyr Glu Ser Ile Ala Lys Phe 1295 1300 1305 His Gly Ser Pro Gly Leu Cys Gln Leu Asn Gly Asn Pro Ile Ser 1310 1315 1320 Cys Cys Val His Gly Met Pro Thr Phe Pro His Trp His Arg Leu 1325 1330 1335 Tyr Val Val Val Val Glu Asn Ala Leu Leu Lys Lys Gly Ser Ser 1340 1345 1350 Val Ala Val Pro Tyr Trp Asp Trp Thr Lys Arg Ile Glu His Leu 1355 1360 1365 Pro His Leu Ile Ser Asp Ala Thr Tyr Tyr Asn Ser Arg Gln His 1370 1375 1380 His Tyr Glu Thr Asn Pro Phe His His Gly Lys Ile Thr His Glu 1385 1390 1395 Asn Glu Ile Thr Thr Arg Asp Pro Lys Asp Ser Leu Phe His Ser 1400 1405 1410 Asp Tyr Phe Tyr Glu Gln Val Leu Tyr Ala Leu Glu Gln Asp Asn 1415 1420 1425 Phe Cys Asp Phe Glu Ile Gln Leu Glu Ile Leu His Asn Ala Leu 1430 1435 1440 His Ser Leu Leu Gly Gly Lys Gly Lys Tyr Ser Met Ser Asn Leu 1445 1450 1455 Asp Tyr Ala Ala Phe Asp Pro Val Phe Phe Leu His His Ala Thr 1460 1465 1470 Thr Asp Arg Ile Trp Ala Ile Trp Gln Asp Leu Gln Arg Phe Arg 1475 1480 1485 Lys Arg Pro Tyr Arg Glu Ala Asn Cys Ala Ile Gln Leu Met His 1490 1495 1500 Thr Pro Leu Gln Pro Phe Asp Lys Ser Asp Asn Asn Asp Glu Ala 1505 1510 1515 Thr Lys Thr His Ala Thr Pro His Asp Gly Phe Glu Tyr Gln Asn 1520 1525 1530 Ser Phe Gly Tyr Ala Tyr Asp Asn Leu Glu Leu Asn His Tyr Ser 1535 1540 1545 Ile Pro Gln Leu Asp His Met Leu Gln Glu Arg Lys Arg His Asp 1550 1555 1560 Arg Val Phe Ala Gly Phe Leu Leu His Asn Ile Gly Thr Ser Ala 1565 1570 1575 Asp Val His Val Phe Val Cys Leu Pro Thr Gly Glu His Thr Lys 1580 1585 1590 Asp Cys Ser His Glu Ala Gly Met Phe Ser Ile Leu Gly Gly Gln 1595 1600 1605 Thr Glu Met Ser Phe Val Phe Asp Arg Leu Tyr Lys Leu Asp Ile 1610 1615 1620 Thr Lys Ala Leu Lys Lys Asn Gly Val His Leu Gln Gly Asp Phe 1625 1630 1635 Asp Leu Glu Ile Glu Ile Thr Ala Val Asn Gly Ser His Leu Asp 1640 1645 1650 Ser His Val Ile His Ser Pro Thr Ile Leu Phe Glu Ala Gly Thr 1655 1660 1665 Asp Ser Ala His Thr Asp Asp Gly His Thr Glu Pro Val Met Ile 1670 1675 1680 Arg Lys Asp Ile Thr Gln Leu Asp Lys Arg Gln Gln Leu Ser Leu 1685 1690 1695 Val Lys Ala Leu Glu Ser Met Lys Ala Asp His Ser Ser Asp Gly 1700 1705 1710 Phe Gln Ala Ile Ala Ser Phe His Ala Leu Pro Pro Leu Cys Pro 1715 1720 1725 Ser Pro Ala Ala Ser Lys Arg Phe Ala Cys Cys Val His Gly Met 1730 1735 1740 Ala Thr Phe Pro Gln Trp His Arg Leu Tyr Thr Val Gln Phe Gln 1745 1750 1755 Asp Ser Leu Arg Lys His Gly Ala Val Val Gly Leu Pro Tyr Trp 1760 1765 1770 Asp Trp Thr Leu Pro Arg Ser Glu Leu Pro Glu Leu Leu Thr Val 1775 1780 1785 Ser Thr Ile His Asp Pro Glu Thr Gly Arg Asp Ile Pro Asn Pro 1790 1795 1800 Phe Ile Gly Ser Lys Ile Glu Phe Glu Gly Glu Asn Val His Thr 1805 1810 1815 Lys Arg Asp Ile Asn Arg Asp Arg Leu Phe Gln Gly Ser Thr Lys 1820 1825 1830 Thr His His Asn Trp Phe Ile Glu Gln Ala Leu Leu Ala Leu Glu 1835 1840 1845 Gln Thr Asn Tyr Cys Asp Phe Glu Val Gln Phe Glu Ile Met His 1850 1855 1860 Asn Gly Val His Thr Trp Val Gly Gly Lys Glu Pro Tyr Gly Ile 1865 1870 1875 Gly His Leu His Tyr Ala Ser Tyr Asp Pro Leu Phe Tyr Ile His 1880 1885 1890 His Ser Gln Thr Asp Arg Ile Trp Ala Ile Trp Gln Ser Leu Gln 1895 1900 1905 Arg Phe Arg Gly Leu Ser Gly Ser Glu Ala Asn Cys Ala Val Asn 1910 1915 1920 Leu Met Lys Thr Pro Leu Lys Pro Phe Ser Phe Gly Ala Pro Tyr 1925 1930 1935 Asn Leu Asn Asp His Thr His Asp Phe Ser Lys Pro Glu Asp Thr 1940 1945 1950 Phe Asp Tyr Gln Lys Phe Gly Tyr Ile Tyr Asp Thr Leu Glu Phe 1955 1960 1965 Ala Gly Trp Ser Ile Arg Gly Ile Asp His Ile Val Arg Asn Arg 1970 1975 1980 Gln Glu His Ser Arg Val Phe Ala Gly Phe Leu Leu Glu Gly Phe 1985 1990 1995 Gly Thr Ser Ala Thr Val Asp Phe Gln Val Cys Arg Thr Ala Gly 2000 2005 2010 Asp Cys Glu Asp Ala Gly Tyr Phe Thr Val Leu Gly Gly Glu Lys 2015 2020 2025 Glu Met Pro Trp Ala Phe Asp Arg Leu Tyr Lys Tyr Asp Ile Thr 2030 2035 2040 Glu Thr Leu Asp Lys Met Asn Leu Arg His Asp Glu Ile Phe Gln 2045 2050 2055 Ile Glu Val Thr Ile Thr Ser Tyr Asp Gly Thr Val Leu Asp Ser 2060 2065 2070 Gly Leu Ile Pro Thr Pro Ser Ile Ile Tyr Asp Pro Ala His His 2075 2080 2085 Asp Ile Ser Ser His His Leu Ser Leu Asn Lys Val Arg His Asp 2090 2095 2100 Leu Ser Thr Leu Ser Glu Arg Asp Ile Gly Ser Leu Lys Tyr Ala 2105 2110 2115 Leu Ser Ser Leu Gln Ala Asp Thr Ser Ala Asp Gly Phe Ala Ala 2120 2125 2130 Ile Ala Ser Phe His Gly Leu Pro Ala Lys Cys Asn Asp Ser His 2135 2140 2145 Asn Asn Glu Val Ala Cys Cys Ile His Gly Met Pro Thr Phe Pro 2150 2155 2160 His Trp His Arg Leu Tyr Thr Leu Gln Phe Glu Gln Ala Leu Arg 2165 2170 2175 Arg His Gly Ser Ser Val Ala Val Pro Tyr Trp Asp Trp Thr Lys 2180 2185 2190 Pro Ile His Asn Ile Pro His Leu Phe Thr Asp Lys Glu Tyr Tyr 2195 2200 2205 Asp Val Trp Arg Asn Lys Val Met Pro Asn Pro Phe Ala Arg Gly 2210 2215 2220 Tyr Val Pro Ser His Asp Thr Tyr Thr Val Arg Asp Val Gln Glu 2225 2230 2235 Gly Leu Phe His Leu Thr Ser Thr Gly Glu His Ser Ala Leu Leu 2240 2245 2250 Asn Gln Ala Leu Leu Ala Leu Glu Gln His Asp Tyr Cys Asp Phe 2255 2260 2265 Ala Val Gln Phe Glu Val Met His Asn Thr Ile His Tyr Leu Val 2270 2275 2280 Gly Gly Pro Gln Val Tyr Ser Leu Ser Ser Leu His Tyr Ala Ser 2285 2290 2295 Tyr Asp Pro Ile Phe Phe Ile His His Ser Phe Val Asp Lys Val 2300 2305 2310 Trp Ala Val Trp Gln Ala Leu Gln Glu Lys Arg Gly Leu Pro Ser 2315 2320 2325 Asp Arg Ala Asp Cys Ala Val Ser Leu Met Thr Gln Asn Met Arg 2330 2335 2340 Pro Phe His His Glu Ile Asn His Asn Gln Phe Thr Lys Lys His 2345 2350 2355 Ala Val Pro Asn Asp Val Phe Lys Tyr Glu Leu Leu Gly Tyr Arg 2360 2365 2370 Tyr Asp Asn Leu Glu Ile Gly Gly Met Asn Leu His Glu Ile Glu 2375 2380 2385 Lys Glu Ile Lys Asp Lys Gln His His Val Arg Val Phe Ala Gly 2390 2395 2400 Phe Leu Leu His Gly Ile Arg Thr Ser Ala Asp Val Gln Phe Gln 2405 2410 2415 Ile Cys Lys Thr Ser Glu Asp Cys His His Gly Gly Gln Ile Phe 2420 2425 2430 Val Leu Gly Gly Thr Lys Glu Met Ala Trp Ala Tyr Asn Arg Leu 2435 2440 2445 Phe Lys Tyr Asp Ile Thr His Ala Leu His Asp Ala His Ile Thr 2450 2455 2460 Pro Glu Asp Val Phe His Pro Ser Glu Pro Phe Phe Ile Lys Val 2465 2470 2475 Ser Val Thr Ala Val Asn Gly Thr Val Leu Pro Ala Ser Ile Leu 2480 2485 2490 His Ala Pro Thr Ile Ile Tyr Glu Pro Gly Leu Asp His His Glu 2495 2500 2505 Asp His His Ser Ser Ser Met Ala Gly His Gly Val Arg Lys Glu 2510 2515 2520 Ile Asn Thr Leu Thr Thr Ala Glu Val Asp Asn Leu Lys Asp Ala 2525 2530 2535 Met Arg Ala Val Met Ala Asp His Gly Pro Asn Gly Tyr Gln Ala 2540 2545 2550 Ile Ala Ala Phe His Gly Asn Pro Pro Met Cys Pro Met Pro Asp 2555 2560 2565 Gly Lys Asn Tyr Ser Cys Cys Thr His Gly Met Ala Thr Phe Pro 2570 2575 2580 His Trp His Arg Leu Tyr Thr Lys Gln Met Glu Asp Ala Leu Thr 2585 2590 2595 Ala His Gly Ala Arg Val Gly Leu Pro Tyr Trp Asp Trp Thr Thr 2600 2605 2610 Ala Phe Thr Ala Leu Pro Thr Phe Val Thr Asp Glu Glu Asp Asn 2615 2620 2625 Pro Phe His His Gly His Ile Asp Tyr Leu Gly Val Asp Thr Thr 2630 2635 2640 Arg Ser Pro Arg Asp Lys Leu Phe Asn Asp Pro Glu Arg Gly Ser 2645 2650 2655 Glu Ser Phe Phe Tyr Arg Gln Val Leu Leu Ala Leu Glu Gln Thr 2660 2665 2670 Asp Phe Cys Gln Phe Glu Val Gln Phe Glu Ile Thr His Asn Ala 2675 2680 2685 Val His Ser Trp Thr Gly Gly Leu Thr Pro Tyr Gly Met Ser Thr 2690 2695 2700 Leu Glu Tyr Thr Thr Tyr Asp Pro Leu Phe Trp Leu His His Ala 2705 2710 2715 Asn Thr Asp Arg Ile Trp Ala Ile Trp Gln Ala Leu Gln Glu Tyr 2720 2725 2730 Arg Gly Leu Pro Tyr Asp His Ala Asn Cys Glu Ile Gln Ala Met 2735 2740 2745 Lys Arg Pro Leu Arg Pro Phe Ser Asp Pro Ile Asn His Asn Ala 2750 2755 2760 Phe Thr His Ser Asn Ala Lys Pro Thr Asp Val Phe Glu Tyr Ser 2765 2770 2775 Arg Phe Asn Phe Gln Tyr Asp Asn Leu Arg Phe His Gly Met Thr 2780 2785 2790 Ile Lys Lys Leu Glu His Glu Leu Glu Lys Gln Lys Glu Glu Asp 2795 2800 2805 Arg Thr Phe Ala Ala Phe Leu Leu His Gly Ile Lys Lys Ser Ala 2810 2815 2820 Asp Val Ser Phe Asp Val Cys Asn His Asp Gly Glu Cys His Phe 2825 2830 2835 Ala Gly Thr Phe Ala Ile Leu Gly Gly Glu His Glu Met Pro Trp 2840 2845 2850 Ser Phe Asp Arg Leu Phe Arg Tyr Asp Ile Thr Gln Val Leu Lys 2855 2860 2865 Gln Met His Leu Glu Tyr Asp Ser Asp Phe Thr Phe His Met Arg 2870 2875 2880 Ile Ile Asp Thr Ser Gly Lys Gln Leu Pro Ser Asp Leu Ile Lys 2885 2890 2895 Met Pro Thr Val Glu His Ser Pro Gly Gly Lys His His Glu Lys 2900 2905 2910 His His Glu Asp His His Glu Asp Ile Leu Val Arg Lys Asn Ile 2915 2920 2925 His Ser Leu Ser His His Glu Ala Glu Glu Leu Arg Asp Ala Leu 2930 2935 2940 Tyr Lys Leu Gln Asn Asp Glu Ser His Gly Gly Tyr Glu His Ile 2945 2950 2955 Ala Gly Phe His Gly Tyr Pro Asn Leu Cys Pro Glu Lys Gly Asp 2960 2965 2970 Glu Lys Tyr Pro Cys Cys Val His Gly Met Ser Ile Phe Pro His 2975 2980 2985 Trp His Arg Leu His Thr Ile Gln Phe Glu Arg Ala Leu Lys Lys 2990 2995 3000 His Gly Ser His Leu Gly Ile Pro Tyr Trp Asp Trp Thr Gln Thr 3005 3010 3015 Ile Ser Ser Leu Pro Thr Phe Phe Ala Asp Ser Gly Asn Asn Asn 3020 3025 3030 Pro Phe Phe Lys Tyr His Ile Arg Ser Ile Asn Gln Asp Thr Val 3035 3040 3045 Arg Asp Val Asn Glu Ala Ile Phe Gln Gln Thr Lys Phe Gly Glu 3050 3055 3060 Phe Ser Ser Ile Phe Tyr Leu Ala Leu Gln Ala Leu Glu Glu Asp 3065 3070 3075 Asn Tyr Cys Asp Phe Glu Val Gln Tyr Glu Ile Leu His Asn Glu 3080 3085 3090 Val His Ala Leu Ile Gly Gly Ala Glu Lys Tyr Ser Met Ser Thr 3095 3100 3105 Leu Glu Tyr Ser Ala Phe Asp Pro Tyr Phe Met Ile His His Ala 3110 3115 3120 Ser Leu Asp Lys Ile Trp Ile Ile Trp Gln Glu Leu Gln Lys Arg 3125 3130 3135 Arg Val Lys Pro Ala His Ala Gly Ser Cys Ala Gly Asp Ile Met 3140 3145 3150 His Val Pro Leu His Pro Phe Asn Tyr Glu Ser Val Asn Asn Asp 3155 3160 3165 Asp Phe Thr Arg Glu Asn Ser Leu Pro Asn Ala Val Val Asp Ser 3170 3175 3180 His Arg Phe Asn Tyr Lys Tyr Asp Asn Leu Asn Leu His Gly His 3185 3190 3195 Asn Ile Glu Glu Leu Glu Glu Val Leu Arg Ser Leu Arg Leu Lys 3200 3205 3210 Ser Arg Val Phe Ala Gly Phe Val Leu Ser Gly Ile Arg Thr Thr 3215 3220 3225 Ala Val Val Lys Val Tyr Ile Lys Ser Gly Thr Asp Ser Asp Asp 3230 3235 3240 Glu Tyr Ala Gly Ser Phe Val Ile Leu Gly Gly Ala Lys Glu Met 3245 3250 3255 Pro Trp Ala Tyr Glu Arg Leu Tyr Arg Phe Asp Ile Thr Glu Thr 3260 3265 3270 Val His Asn Leu Asn Leu Thr Asp Asp His Val Lys Phe Arg Phe 3275 3280 3285 Asp Leu Lys Lys Tyr Asp His Thr Glu Leu Asp Ala Ser Val Leu 3290 3295 3300 Pro Ala Pro Ile Ile Val Arg Arg Pro Asn Asn Ala Val Phe Asp 3305 3310 3315 Ile Ile Glu Ile Pro Ile Gly Lys Asp Val Asn Leu Pro Pro Lys 3320 3325 3330 Val Val Val Lys Arg Gly Thr Lys Ile Met Phe Met Ser Val Asp 3335 3340 3345 Glu Ala Val Thr Thr Pro Met Leu Asn Leu Gly Ser Tyr Thr Ala 3350 3355 3360 Met Phe Lys Cys Lys Val Pro Pro Phe Ser Phe His Ala Phe Glu 3365 3370 3375 Leu Gly Lys Met Tyr Ser Val Glu Ser Gly Asp Tyr Phe Met Thr 3380 3385 3390 Ala Ser Thr Thr Glu Leu Cys Asn Asp Asn Asn Leu Arg Ile His 3395 3400 3405 Val His Val Asp Asp Glu 3410 31 10949 DNA Megathura crenulata 31 ctggcgattt ggagaggcgc aatgctgtcg gtcaggttgc ttatagtcgt gttggctttg 60 gccaatgcag agaacctggt cagaaagagt gtggagcatc tgactcagga agagaccctg 120 gacctgcagg ctgccctgcg tgagctgcag atggactcat ccagcattgg tttccagaaa 180 atagctgctg ctcatggcgc gcccgcatct tgtgtgcata aggacacttc tatagcttgc 240 tgtattcacg gcatgcctac tttcccgcac tggcacaggg cgtacgtagt tcacatggag 300 cgagctctgc agacgaagag gcgtacttct gggatgccat actgggactg gacagaaccc 360 atcacccagt tgccctctct cgctgcggac cctgtttaca tcgattccca gggcggaaag 420 gctcacacca actactggta cagagggaac attgactttc tagacaagaa aaccaatcgt 480 gcggtcgacg accgtctttt tgagaaggtg aagccaggcc agcacaccca tcttatggag 540 agtgttttgg acgcactgga gcaggatgaa ttctgtaaat tcgaaatcca gttcgagtta 600 gcacataatg cgattcacta cctggtcggt ggaaagcatg attactctat ggcaaacctc 660 gagtacaccg cctacgatcc gatcttcttt cttcaccact ccaacgttga cagaattttc 720 gccatctggc agaggctcca agagctccga aacaaagacc ccaaagctat ggattgtgcc 780 caagaattgt tacatcagaa aatggaacct ttcagttggg aggacaatga cataccacta 840 acaaatgaac attccactcc agccgatctc tttgactatc gtcaacttca ctacgactat 900 gacacgttga atcttaatgg tatgactcca gaggaactga aaacctattt ggatgagcgg 960 tcttcaaggg caagagcctt cgcaagcttc cgtctgaaag gatttggagg ttcagccaat 1020 gtctttgtgt acgtctgcat ccccgacgac aacgatcgta atgatgacca ctgtgagaag 1080 gccggtgatt tcttcgttct gggcggacca agcgagatga aatggcagtt ctacaggcca 1140 tacttgtttg acttgtccga cactgttcac aagatgggaa tgaagttgga tggacactat 1200 accgtcaagg cagagctttt cagtgtcaat ggtacagctc ttcccgatga tttgttacct 1260 catccagttg ttgttcatca tccggagaag ggtttcactg atcctccggt gaagcatcac 1320 caaagcgcca atctcctcgt tagaaagaac atcaatgatc tcacccgtga agaagtgttg 1380 aatctgagag aggcgtttca taaattccaa gaagatcgct ccgtcgacgg ttatcaagcc 1440 acagctgagt atcatggtct tccggccaga tgccctcgcc ctgatgccaa ggacagatat 1500 gcttgttgtg tccatggaat gccaatcttc cctcactggc acaggctttt tgtcacacaa 1560 gttgaagatg ctttagtagg ccgtggagct accattggta tcccatactg ggactggact 1620 gaacccatga cacacattcc gggtctggca ggaaacgaaa cttatgtgga ttctcatggt 1680 gcatcccaca caaatccttt tcatagttca gtgattgcat ttgaagaaaa tgctccccac 1740 accaaaagac aaatagatca aagactcttt aaacccgcta cctttggaca ccacacagac 1800 ctgttcaacc agattttgta tgcctttgaa caagaagatt actgtgactt tgaagtccaa 1860 tttgagatta cccataacac gattcacgct tggacaggag gaagcgaaca tttctcaatg 1920 tcgtccctac attacacagc tttcgatcct ttgttttact ttcaccattc taacgttgat 1980 cgtctttggg ccgtttggca agccttacag atgagacggc ataaacccta cagggcccac 2040 tgcgccatat ctctggaaca tatgcatctg aaaccattcg ccttttcatc tccccttaac 2100 aataacgaaa agactcatgc caatgccatg ccaaacaaga tctacgacta tgaaaatgtc 2160 ctccattaca catacgaaga tttaacattt ggaggcatct ctctggaaaa catagaaaag 2220 atgatccacg aaaaccagca agaagacaga atatatgccg gttttctcct ggctggcata 2280 cgtacttcag caaatgttga tatcttcatt aaaactaccg attccgtgca acataaggct 2340 ggaacatttg cagtgctcgg tggaagcaag gaaatgaagt ggggatttga tcgcgttttc 2400 aagtttgaca tcacgcacgt tttgaaagat ctcgatctca ctgctgatgg cgatttcgaa 2460 gttactgttg acatcactga agtcgatgga actaaacttg catccagtct tattccacat 2520 gcttctgtca ttcgtgagca tgcacgtgtt aaatttgaca aagtgccaag gagtcgtctt 2580 attcgaaaaa atgtagaccg tttgagcccc gaggagatga atgaacttcg taaagcccta 2640 gccttactga aagaggacaa aagtgccggt ggatttcagc agcttggtgc attccatggg 2700 gagccaaaat ggtgtcctag tcccgaagca tctaaaaaat ttgcctgctg tgttcacggc 2760 atgtctgtgt tccctcactg gcatcgactg ttgacggttc agagtgaaaa tgctttgaga 2820 cgacatggct acgatggagc tttgccgtac tgggattgga cctctcctct taatcacctt 2880 cccgaactgg cagatcatga gaagtacgtc gaccctgaag atggggtaga gaagcataac 2940 ccttggttcg atggtcatat agatacagtc gacaaaacaa caacaagaag tgttcagaat 3000 aaactcttcg aacagcctga gtttggtcat tatacaagca ttgccaaaca agtactgcta 3060 gcgttggaac aggacaattt ctgtgacttt gaaatccaat atgagattgc ccataactac 3120 atccatgcac ttgtaggagg cgctcagcct tatggtatgg catcgcttcg ctacactgct 3180 tttgatccac tattctactt gcatcactct aatacagatc gtatatgggc aatatggcag 3240 gctttacaga agtacagagg aaaaccgtac aacgttgcta actgtgctgt tacatcgatg 3300 agagaacctt tgcaaccatt tggcctctct gccaatatca acacagacca tgtaaccaag 3360 gagcattcag tgccattcaa cgtttttgat tacaagacca atttcaatta tgaatatgac 3420 actttggaat ttaacggtct ctcaatctct cagttgaata aaaagctcga agcgataaag 3480 agccaagaca ggttctttgc aggcttcctg ttatctggtt tcaagaaatc atctcttgtt 3540 aaattcaata tttgcaccga tagcagcaac tgtcaccccg ctggagagtt ttaccttctg 3600 ggtgatgaaa acgagatgcc atgggcatac gatagagtct tcaaatatga cataaccgaa 3660 aaactccacg atctaaagct gcatgcagaa gaccacttct acattgacta tgaagtattt 3720 gaccttaaac cagcaagcct gggaaaagat ttgttcaagc agccttcagt cattcatgaa 3780 ccaagaatag gtcaccatga aggcgaagta tatcaagctg aagtaacttc tgccaaccgt 3840 attcgaaaaa acattgaaaa tctgagcctt ggtgaactcg aaagtctgag agctgccttc 3900 ctggaaattg aaaacgatgg aacttacgaa tcaatagcta aattccatgg tagccctggt 3960 ttgtgccagt taaatggtaa ccccatctct tgttgtgtcc atggcatgcc aactttccct 4020 cactggcaca gactgtacgt ggttgtcgtt gagaatgccc tcctgaaaaa aggatcatct 4080 gtagctgttc cctattggga ctggacaaaa cgaatcgaac atttacctca cctgatttca 4140 gacgccactt actacaattc caggcaacat cactatgaga caaacccatt ccatcatggc 4200 aaaatcacac acgagaatga aatcactact agggatccca aggacagcct cttccattca 4260 gactactttt acgagcaggt cctttacgcc ttggagcagg ataacttctg tgatttcgag 4320 attcagttgg agatattaca caatgcattg cattctttac ttggtggcaa aggtaaatat 4380 tccatgtcaa accttgatta cgctgctttt gatcctgtgt tcttccttca tcacgcaacg 4440 actgacagaa tctgggcaat ctggcaagac cttcagaggt tccgaaaacg gccataccga 4500 gaagcgaatt gcgctatcca attgatgcac acgccactcc agccgtttga taagagcgac 4560 aacaatgacg aggcaacgaa aacgcatgcc actccacatg atggttttga atatcaaaac 4620 agctttggtt atgcttacga taatctggaa ctgaatcact actcgattcc tcagcttgat 4680 cacatgctgc aagaaagaaa aaggcatgac agagtattcg ctggcttcct ccttcacaat 4740 attggaacat ctgccgatgt ccatgtattt gtatgtctcc caactgggga acacacgaag 4800 gactgcagtc atgaggctgg tatgttctcc atcttaggcg gtcaaacgga gatgtccttt 4860 gtatttgaca gactttacaa acttgacata actaaagcct tgaaaaagaa cggtgtgcac 4920 ctgcaagggg atttcgatct ggaaattgag attacggctg tgaatggatc tcatctagac 4980 agtcatgtca tccactctcc cactatactg tttgaggccg gaacagattc tgcccacaca 5040 gatgatggac acactgaacc agtgatgatt cgcaaagata tcacacaatt ggacaagcgt 5100 caacaactgt cactggtgaa agccctcgag tccatgaaag ccgaccattc atctgatggg 5160 ttccaggcaa tcgcttcctt ccatgctctt cctcctcttt gtccatcacc agctgcttca 5220 aagaggtttg cgtgctgcgt ccatggcatg gcaacgttcc cacaatggca ccgtctgtac 5280 acagtccaat tccaagattc tctcagaaaa catggtgcag tcgttggact tccgtactgg 5340 gactggaccc tacctcgttc tgaattacca gagctcctga ccgtctcaac tattcatgac 5400 ccggagacag gcagagatat accaaatcca tttattggtt ctaaaataga gtttgaagga 5460 gaaaacgtac atactaaaag agatatcaat agggatcgtc tcttccaggg atcaacaaaa 5520 acacatcata actggtttat tgagcaagca ctgcttgctc ttgaacaaac caactactgc 5580 gacttcgagg ttcagtttga aattatgcat aatggtgttc atacctgggt tggaggcaag 5640 gagccctatg gaattggcca tctgcattat gcttcctatg atccactttt ctacatccat 5700 cactcccaaa ctgatcgtat ttgggctata tggcaatcgt tgcagcgttt cagaggactt 5760 tctggatctg aggctaactg tgctgtaaat ctcatgaaaa ctcctctgaa gcctttcagc 5820 tttggagcac catataatct taatgatcac acgcatgatt tctcaaagcc tgaagataca 5880 ttcgactacc aaaagtttgg atacatatat gacactctgg aatttgcagg gtggtcaatt 5940 cgtggcattg accatattgt ccgtaacagg caggaacatt caagggtctt tgccggattc 6000 ttgcttgaag gatttggcac ctctgccact gtcgatttcc aggtctgtcg cacagcggga 6060 gactgtgaag atgcagggta cttcaccgtg ttgggaggtg aaaaagaaat gccttgggcc 6120 tttgatcggc tttacaagta cgacataaca gaaaccttag acaagatgaa ccttcgacat 6180 gacgaaatct tccagattga agtaaccatt acatcctacg atggaactgt actcgatagt 6240 ggccttattc ccacaccgtc aatcatctat gatcctgctc atcatgatat tagttcgcac 6300 cacctgtcgc tcaacaaggt tcgtcatgat ctgagtacac tgagtgagcg agatattgga 6360 agccttaaat atgctttgag cagcttgcag gcagatacct cagcagatgg ttttgctgcc 6420 attgcatcct tccatggtct gcctgccaaa tgtaatgaca gccacaataa cgaggtggca 6480 tgctgtatcc atggaatgcc tacattcccc cactggcaca gactctacac cctccaattt 6540 gagcaagctc taagaagaca tggctctagt gtagcagtac cctactggga ctggacaaag 6600 ccaatacata atattccaca tctgttcaca gacaaagaat actacgatgt ctggagaaat 6660 aaagtaatgc caaatccatt tgcccgaggg tatgtcccct cacacgatac atacacggta 6720 agagacgtcc aagaaggcct gttccacctg acatcaacgg gtgaacactc agcgcttctg 6780 aatcaagctc ttttggcgct ggaacagcac gactactgcg attttgcagt ccagtttgaa 6840 gtcatgcaca acacaatcca ttacctagtg ggaggacctc aagtctattc tttgtcatcc 6900 cttcattatg cttcatatga tccgatcttc ttcatacacc actcctttgt agacaaggtt 6960 tgggctgtct ggcaggctct tcaagaaaag agaggccttc catcagaccg tgctgactgc 7020 gctgttagtc tcatgactca gaacatgagg cctttccatc acgaaattaa ccataaccag 7080 ttcaccaaga aacatgcagt tccaaatgat gttttcaagt acgaactcct gggttacaga 7140 tacgacaatc tggaaatcgg tggcatgaat ttgcatgaaa ttgaaaagga aatcaaagac 7200 aaacagcacc atgtgagagt gtttgcaggg ttcctccttc acggaattag aacctcagct 7260 gatgtccaat tccagatttg taaaacatca gaagattgtc accatggagg ccaaatcttc 7320 gttcttgggg ggactaaaga gatggcctgg gcttataacc gtttattcaa gtacgatatt 7380 acccatgctc ttcatgacgc acacatcact ccagaagacg tattccatcc ctctgaacca 7440 ttcttcatca aggtgtcagt gacagccgtc aacggaacag ttcttccggc ttcaatcctg 7500 catgcaccaa ccattatcta tgaacctggt ctcgaccatc acgaagatca tcattcttct 7560 tctatggctg gacatggtgt cagaaaggaa atcaacacac ttaccactgc agaggtggac 7620 aatctcaaag atgccatgag agccgtcatg gcagaccacg gtccaaatgg ataccaggct 7680 atagcagcgt tccatggaaa cccaccaatg tgccctatgc cagatggaaa gaattactcg 7740 tgttgtacac atggcatggc tactttcccc cactggcaca gactgtacac aaaacagatg 7800 gaagatgcct tgaccgccca tggtgccaga gtcggccttc cttactggga ctggacaact 7860 gcctttacag ctttgccaac ttttgtcaca gatgaagagg acaatccttt ccatcatggt 7920 cacatagact atttgggagt ggatacaact cggtcgcccc gagacaagtt gttcaatgat 7980 ccagagcgag gatcagaatc gttcttctac aggcaggttc tcttggcttt ggagcagaca 8040 gatttttgtc agtttgaagt ccagttcgag attacccaca atgccgtcca ttcttggact 8100 ggtggactga ccccctacgg aatgtccact ctggagtaca cgacctatga tcctcttttc 8160 tggcttcatc atgccaatac tgaccgaata tgggctattt ggcaagctct tcaagaatac 8220 agaggtctcc catacgacca tgccaactgc gaaatccagg caatgaagag gcccctacgg 8280 ccattctctg atcctattaa tcacaatgcg ttcactcatt ctaacgcaaa accaacagac 8340 gttttcgagt acagtagatt caacttccag tatgataact tacggtttca tggtatgaca 8400 ataaaaaagt tggagcatga gcttgagaaa cagaaggaag aagacaggac attcgctgcc 8460 ttcctgctcc acggcattaa aaaaagtgct gatgtaagct ttgacgtgtg caaccatgat 8520 ggagaatgcc actttgcagg taccttcgct attcttggag gcgagcacga gatgccttgg 8580 tcctttgaca gacttttccg atatgacatc acccaagttc ttaagcaaat gcacctggag 8640 tatgattctg atttcacgtt ccatatgaga attattgaca cttcgggtaa acaactgcca 8700 tcagacctga tcaagatgcc aacggttgaa cacagcccag gaggaaaaca tcatgagaaa 8760 catcatgaag atcaccatga agacatactt gttcgtaaaa acattcactc tctgtcacac 8820 catgaagccg aagaattgag agatgcattg tacaagcttc aaaacgatga aagtcacggt 8880 ggatatgaac atattgctgg attccatggt tatcctaacc tctgtcctga aaaaggggat 8940 gaaaagtatc cttgctgtgt tcatggtatg tctatattcc cccactggca caggctccac 9000 acgatacagt ttgaaagagc tctgaagaaa catggttccc atctcggtat tccttactgg 9060 gattggactc agacgatatc gtctcttcct accttctttg cagattctgg taataataat 9120 ccattcttca aataccatat cagaagcata aatcaagaca ctgttcgaga cgttaatgaa 9180 gcgatattcc aacaaacgaa gtttggagaa ttttcgtcca tattttacct ggccttgcag 9240 gctctagaag aggataatta ttgcgatttt gaggtgcagt acgagatact tcacaatgag 9300 gtgcatgcct tgattggtgg agccgagaag tattccatgt ccacgctgga atattcagcc 9360 ttcgatccat acttcatgat ccaccacgcg tctttggaca agatctggat aatctggcag 9420 gagcttcaga aacgccgggt taagcctgca catgctggtt cgtgtgccgg agacattatg 9480 cacgttccac ttcatccttt caattatgaa agcgtcaaca acgatgactt taccagagag 9540 aattctctcc ccaacgcggt cgttgacagt catagattca actataaata tgacaatctt 9600 aatcttcacg gccacaatat agaggagctt gaggaggttc tccgcagcct gagacttaaa 9660 tctcgtgttt tcgcaggatt tgttctgtcc ggaatccgca caactgcagt tgtgaaagtg 9720 tatatcaaaa gcggaaccga ttccgacgac gaatatgccg gttcgtttgt catccttgga 9780 ggagccaaag agatgccttg ggcatatgag aggttgtata gatttgacat tacagaaact 9840 gtacacaatc ttaaccttac ggatgatcat gttaaattca ggtttgactt aaagaaatat 9900 gatcatactg aattggatgc atcagtactg ccagctccta taatcgttcg acgtccaaat 9960 aatgcggttt ttgatatcat tgagattcca atagggaaag acgtgaatct tccaccgaag 10020 gtggtcgtca agagaggcac taaaatcatg ttcatgtctg ttgatgaagc cgttaccact 10080 cccatgctga acctgggtag ttatacagca atgttcaaat gtaaagtgcc accattcagc 10140 ttccatgctt ttgagcttgg gaaaatgtat tctgtcgaat ctggtgacta tttcatgacg 10200 gcatcaacaa ctgaattatg caacgataat aacttaagga tacatgtcca tgtggatgac 10260 gaatagacaa aggtattctg atgcgttgag gatgtttctg aaacgtttgg aatatagtat 10320 agtatagtag atgagaagga atgggctcct gttatcaaac tcataataat tattttgata 10380 gccagatttc ctgttttgtt aatgaattac gaagacagtc tatgttcttt tcactgtaag 10440 acaggtctta cgttagtccc tttctaaaga ggtaacattg gcagctgagc cgtggggagg 10500 tggtgcggct gaaatcccct catttttacg ggatggggtt taaccctccc cctcccactt 10560 ttgaaaacga acgtaaattt tactcattcg aaatagctca gagcagatct tagacgatct 10620 acgaccccag accttcccat tcggtaatcg cttccgtcgc ccatggataa gatagttaat 10680 tgcaagaacg acttctctca tttggtttta agaagaaaag aatattcccc aaagtaaaag 10740 tatttgtatg agtgttaact cgccattggc ctgagagatg gctgttaatt gaaattaaaa 10800 tattgtaaat atttgtaact tcgagttaaa tggaatcttc gtgtaatgaa ttttttcctg 10860 caagacattg tatctcatgc agctcatatt tataattgtg tcaataaaat gtgtcatttg 10920 caaaaaaaaa aaaaaaaaaa aaaaaaaaa 10949 32 10949 DNA Megathura crenulata SEQ ID NO32 is the complementary sequence of the sequence of SEQ ID NO31 32 tttttttttt tttttttttt ttttttttgc aaatgacaca ttttattgac acaattataa 60 atatgagctg catgagatac aatgtcttgc aggaaaaaat tcattacacg aagattccat 120 ttaactcgaa gttacaaata tttacaatat tttaatttca attaacagcc atctctcagg 180 ccaatggcga gttaacactc atacaaatac ttttactttg gggaatattc ttttcttctt 240 aaaaccaaat gagagaagtc gttcttgcaa ttaactatct tatccatggg cgacggaagc 300 gattaccgaa tgggaaggtc tggggtcgta gatcgtctaa gatctgctct gagctatttc 360 gaatgagtaa aatttacgtt cgttttcaaa agtgggaggg ggagggttaa accccatccc 420 gtaaaaatga ggggatttca gccgcaccac ctccccacgg ctcagctgcc aatgttacct 480 ctttagaaag ggactaacgt aagacctgtc ttacagtgaa aagaacatag actgtcttcg 540 taattcatta acaaaacagg aaatctggct atcaaaataa ttattatgag tttgataaca 600 ggagcccatt ccttctcatc tactatacta tactatattc caaacgtttc agaaacatcc 660 tcaacgcatc agaatacctt tgtctattcg tcatccacat ggacatgtat ccttaagtta 720 ttatcgttgc ataattcagt tgttgatgcc gtcatgaaat agtcaccaga ttcgacagaa 780 tacattttcc caagctcaaa agcatggaag ctgaatggtg gcactttaca tttgaacatt 840 gctgtataac tacccaggtt cagcatggga gtggtaacgg cttcatcaac agacatgaac 900 atgattttag tgcctctctt gacgaccacc ttcggtggaa gattcacgtc tttccctatt 960 ggaatctcaa tgatatcaaa aaccgcatta tttggacgtc gaacgattat aggagctggc 1020 agtactgatg catccaattc agtatgatca tatttcttta agtcaaacct gaatttaaca 1080 tgatcatccg taaggttaag attgtgtaca gtttctgtaa tgtcaaatct atacaacctc 1140 tcatatgccc aaggcatctc tttggctcct ccaaggatga caaacgaacc ggcatattcg 1200 tcgtcggaat cggttccgct tttgatatac actttcacaa ctgcagttgt gcggattccg 1260 gacagaacaa atcctgcgaa aacacgagat ttaagtctca ggctgcggag aacctcctca 1320 agctcctcta tattgtggcc gtgaagatta agattgtcat atttatagtt gaatctatga 1380 ctgtcaacga ccgcgttggg gagagaattc tctctggtaa agtcatcgtt gttgacgctt 1440 tcataattga aaggatgaag tggaacgtgc ataatgtctc cggcacacga accagcatgt 1500 gcaggcttaa cccggcgttt ctgaagctcc tgccagatta tccagatctt gtccaaagac 1560 gcgtggtgga tcatgaagta tggatcgaag gctgaatatt ccagcgtgga catggaatac 1620 ttctcggctc caccaatcaa ggcatgcacc tcattgtgaa gtatctcgta ctgcacctca 1680 aaatcgcaat aattatcctc ttctagagcc tgcaaggcca ggtaaaatat ggacgaaaat 1740 tctccaaact tcgtttgttg gaatatcgct tcattaacgt ctcgaacagt gtcttgattt 1800 atgcttctga tatggtattt gaagaatgga ttattattac cagaatctgc aaagaaggta 1860 ggaagagacg atatcgtctg agtccaatcc cagtaaggaa taccgagatg ggaaccatgt 1920 ttcttcagag ctctttcaaa ctgtatcgtg tggagcctgt gccagtgggg gaatatagac 1980 ataccatgaa cacagcaagg atacttttca tccccttttt caggacagag gttaggataa 2040 ccatggaatc cagcaatatg ttcatatcca ccgtgacttt catcgttttg aagcttgtac 2100 aatgcatctc tcaattcttc ggcttcatgg tgtgacagag agtgaatgtt tttacgaaca 2160 agtatgtctt catggtgatc ttcatgatgt ttctcatgat gttttcctcc tgggctgtgt 2220 tcaaccgttg gcatcttgat caggtctgat ggcagttgtt tacccgaagt gtcaataatt 2280 ctcatatgga acgtgaaatc agaatcatac tccaggtgca tttgcttaag aacttgggtg 2340 atgtcatatc ggaaaagtct gtcaaaggac caaggcatct cgtgctcgcc tccaagaata 2400 gcgaaggtac ctgcaaagtg gcattctcca tcatggttgc acacgtcaaa gcttacatca 2460 gcactttttt taatgccgtg gagcaggaag gcagcgaatg tcctgtcttc ttccttctgt 2520 ttctcaagct catgctccaa cttttttatt gtcataccat gaaaccgtaa gttatcatac 2580 tggaagttga atctactgta ctcgaaaacg tctgttggtt ttgcgttaga atgagtgaac 2640 gcattgtgat taataggatc agagaatggc cgtaggggcc tcttcattgc ctggatttcg 2700 cagttggcat ggtcgtatgg gagacctctg tattcttgaa gagcttgcca aatagcccat 2760 attcggtcag tattggcatg atgaagccag aaaagaggat cataggtcgt gtactccaga 2820 gtggacattc cgtagggggt cagtccacca gtccaagaat ggacggcatt gtgggtaatc 2880 tcgaactgga cttcaaactg acaaaaatct gtctgctcca aagccaagag aacctgcctg 2940 tagaagaacg attctgatcc tcgctctgga tcattgaaca acttgtctcg gggcgaccga 3000 gttgtatcca ctcccaaata gtctatgtga ccatgatgga aaggattgtc ctcttcatct 3060 gtgacaaaag ttggcaaagc tgtaaaggca gttgtccagt cccagtaagg aaggccgact 3120 ctggcaccat gggcggtcaa ggcatcttcc atctgttttg tgtacagtct gtgccagtgg 3180 gggaaagtag ccatgccatg tgtacaacac gagtaattct ttccatctgg catagggcac 3240 attggtgggt ttccatggaa cgctgctata gcctggtatc catttggacc gtggtctgcc 3300 atgacggctc tcatggcatc tttgagattg tccacctctg cagtggtaag tgtgttgatt 3360 tcctttctga caccatgtcc agccatagaa gaagaatgat gatcttcgtg atggtcgaga 3420 ccaggttcat agataatggt tggtgcatgc aggattgaag ccggaagaac tgttccgttg 3480 acggctgtca ctgacacctt gatgaagaat ggttcagagg gatggaatac gtcttctgga 3540 gtgatgtgtg cgtcatgaag agcatgggta atatcgtact tgaataaacg gttataagcc 3600 caggccatct ctttagtccc cccaagaacg aagatttggc ctccatggtg acaatcttct 3660 gatgttttac aaatctggaa ttggacatca gctgaggttc taattccgtg aaggaggaac 3720 cctgcaaaca ctctcacatg gtgctgtttg tctttgattt ccttttcaat ttcatgcaaa 3780 ttcatgccac cgatttccag attgtcgtat ctgtaaccca ggagttcgta cttgaaaaca 3840 tcatttggaa ctgcatgttt cttggtgaac tggttatggt taatttcgtg atggaaaggc 3900 ctcatgttct gagtcatgag actaacagcg cagtcagcac ggtctgatgg aaggcctctc 3960 ttttcttgaa gagcctgcca gacagcccaa accttgtcta caaaggagtg gtgtatgaag 4020 aagatcggat catatgaagc ataatgaagg gatgacaaag aatagacttg aggtcctccc 4080 actaggtaat ggattgtgtt gtgcatgact tcaaactgga ctgcaaaatc gcagtagtcg 4140 tgctgttcca gcgccaaaag agcttgattc agaagcgctg agtgttcacc cgttgatgtc 4200 aggtggaaca ggccttcttg gacgtctctt accgtgtatg tatcgtgtga ggggacatac 4260 cctcgggcaa atggatttgg cattacttta tttctccaga catcgtagta ttctttgtct 4320 gtgaacagat gtggaatatt atgtattggc tttgtccagt cccagtaggg tactgctaca 4380 ctagagccat gtcttcttag agcttgctca aattggaggg tgtagagtct gtgccagtgg 4440 gggaatgtag gcattccatg gatacagcat gccacctcgt tattgtggct gtcattacat 4500 ttggcaggca gaccatggaa ggatgcaatg gcagcaaaac catctgctga ggtatctgcc 4560 tgcaagctgc tcaaagcata tttaaggctt ccaatatctc gctcactcag tgtactcaga 4620 tcatgacgaa ccttgttgag cgacaggtgg tgcgaactaa tatcatgatg agcaggatca 4680 tagatgattg acggtgtggg aataaggcca ctatcgagta cagttccatc gtaggatgta 4740 atggttactt caatctggaa gatttcgtca tgtcgaaggt tcatcttgtc taaggtttct 4800 gttatgtcgt acttgtaaag ccgatcaaag gcccaaggca tttctttttc acctcccaac 4860 acggtgaagt accctgcatc ttcacagtct cccgctgtgc gacagacctg gaaatcgaca 4920 gtggcagagg tgccaaatcc ttcaagcaag aatccggcaa agacccttga atgttcctgc 4980 ctgttacgga caatatggtc aatgccacga attgaccacc ctgcaaattc cagagtgtca 5040 tatatgtatc caaacttttg gtagtcgaat gtatcttcag gctttgagaa atcatgcgtg 5100 tgatcattaa gattatatgg tgctccaaag ctgaaaggct tcagaggagt tttcatgaga 5160 tttacagcac agttagcctc agatccagaa agtcctctga aacgctgcaa cgattgccat 5220 atagcccaaa tacgatcagt ttgggagtga tggatgtaga aaagtggatc ataggaagca 5280 taatgcagat ggccaattcc atagggctcc ttgcctccaa cccaggtatg aacaccatta 5340 tgcataattt caaactgaac ctcgaagtcg cagtagttgg tttgttcaag agcaagcagt 5400 gcttgctcaa taaaccagtt atgatgtgtt tttgttgatc cctggaagag acgatcccta 5460 ttgatatctc ttttagtatg tacgttttct ccttcaaact ctattttaga accaataaat 5520 ggatttggta tatctctgcc tgtctccggg tcatgaatag ttgagacggt caggagctct 5580 ggtaattcag aacgaggtag ggtccagtcc cagtacggaa gtccaacgac tgcaccatgt 5640 tttctgagag aatcttggaa ttggactgtg tacagacggt gccattgtgg gaacgttgcc 5700 atgccatgga cgcagcacgc aaacctcttt gaagcagctg gtgatggaca aagaggagga 5760 agagcatgga aggaagcgat tgcctggaac ccatcagatg aatggtcggc tttcatggac 5820 tcgagggctt tcaccagtga cagttgttga cgcttgtcca attgtgtgat atctttgcga 5880 atcatcactg gttcagtgtg tccatcatct gtgtgggcag aatctgttcc ggcctcaaac 5940 agtatagtgg gagagtggat gacatgactg tctagatgag atccattcac agccgtaatc 6000 tcaatttcca gatcgaaatc cccttgcagg tgcacaccgt tctttttcaa ggctttagtt 6060 atgtcaagtt tgtaaagtct gtcaaataca aaggacatct ccgtttgacc gcctaagatg 6120 gagaacatac cagcctcatg actgcagtcc ttcgtgtgtt ccccagttgg gagacataca 6180 aatacatgga catcggcaga tgttccaata ttgtgaagga ggaagccagc gaatactctg 6240 tcatgccttt ttctttcttg cagcatgtga tcaagctgag gaatcgagta gtgattcagt 6300 tccagattat cgtaagcata accaaagctg ttttgatatt caaaaccatc atgtggagtg 6360 gcatgcgttt tcgttgcctc gtcattgttg tcgctcttat caaacggctg gagtggcgtg 6420 tgcatcaatt ggatagcgca attcgcttct cggtatggcc gttttcggaa cctctgaagg 6480 tcttgccaga ttgcccagat tctgtcagtc gttgcgtgat gaaggaagaa cacaggatca 6540 aaagcagcgt aatcaaggtt tgacatggaa tatttacctt tgccaccaag taaagaatgc 6600 aatgcattgt gtaatatctc caactgaatc tcgaaatcac agaagttatc ctgctccaag 6660 gcgtaaagga cctgctcgta aaagtagtct gaatggaaga ggctgtcctt gggatcccta 6720 gtagtgattt cattctcgtg tgtgattttg ccatgatgga atgggtttgt ctcatagtga 6780 tgttgcctgg aattgtagta agtggcgtct gaaatcaggt gaggtaaatg ttcgattcgt 6840 tttgtccagt cccaataggg aacagctaca gatgatcctt ttttcaggag ggcattctca 6900 acgacaacca cgtacagtct gtgccagtga gggaaagttg gcatgccatg gacacaacaa 6960 gagatggggt taccatttaa ctggcacaaa ccagggctac catggaattt agctattgat 7020 tcgtaagttc catcgttttc aatttccagg aaggcagctc tcagactttc gagttcacca 7080 aggctcagat tttcaatgtt ttttcgaata cggttggcag aagttacttc agcttgatat 7140 acttcgcctt catggtgacc tattcttggt tcatgaatga ctgaaggctg cttgaacaaa 7200 tcttttccca ggcttgctgg tttaaggtca aatacttcat agtcaatgta gaagtggtct 7260 tctgcatgca gctttagatc gtggagtttt tcggttatgt catatttgaa gactctatcg 7320 tatgcccatg gcatctcgtt ttcatcaccc agaaggtaaa actctccagc ggggtgacag 7380 ttgctgctat cggtgcaaat attgaattta acaagagatg atttcttgaa accagataac 7440 aggaagcctg caaagaacct gtcttggctc tttatcgctt cgagcttttt attcaactga 7500 gagattgaga gaccgttaaa ttccaaagtg tcatattcat aattgaaatt ggtcttgtaa 7560 tcaaaaacgt tgaatggcac tgaatgctcc ttggttacat ggtctgtgtt gatattggca 7620 gagaggccaa atggttgcaa aggttctctc atcgatgtaa cagcacagtt agcaacgttg 7680 tacggttttc ctctgtactt ctgtaaagcc tgccatattg cccatatacg atctgtatta 7740 gagtgatgca agtagaatag tggatcaaaa gcagtgtagc gaagcgatgc cataccataa 7800 ggctgagcgc ctcctacaag tgcatggatg tagttatggg caatctcata ttggatttca 7860 aagtcacaga aattgtcctg ttccaacgct agcagtactt gtttggcaat gcttgtataa 7920 tgaccaaact caggctgttc gaagagttta ttctgaacac ttcttgttgt tgttttgtcg 7980 actgtatcta tatgaccatc gaaccaaggg ttatgcttct ctaccccatc ttcagggtcg 8040 acgtacttct catgatctgc cagttcggga aggtgattaa gaggagaggt ccaatcccag 8100 tacggcaaag ctccatcgta gccatgtcgt ctcaaagcat tttcactctg aaccgtcaac 8160 agtcgatgcc agtgagggaa cacagacatg ccgtgaacac agcaggcaaa ttttttagat 8220 gcttcgggac taggacacca ttttggctcc ccatggaatg caccaagctg ctgaaatcca 8280 ccggcacttt tgtcctcttt cagtaaggct agggctttac gaagttcatt catctcctcg 8340 gggctcaaac ggtctacatt ttttcgaata agacgactcc ttggcacttt gtcaaattta 8400 acacgtgcat gctcacgaat gacagaagca tgtggaataa gactggatgc aagtttagtt 8460 ccatcgactt cagtgatgtc aacagtaact tcgaaatcgc catcagcagt gagatcgaga 8520 tctttcaaaa cgtgcgtgat gtcaaacttg aaaacgcgat caaatcccca cttcatttcc 8580 ttgcttccac cgagcactgc aaatgttcca gccttatgtt gcacggaatc ggtagtttta 8640 atgaagatat caacatttgc tgaagtacgt atgccagcca ggagaaaacc ggcatatatt 8700 ctgtcttctt gctggttttc gtggatcatc ttttctatgt tttccagaga gatgcctcca 8760 aatgttaaat cttcgtatgt gtaatggagg acattttcat agtcgtagat cttgtttggc 8820 atggcattgg catgagtctt ttcgttattg ttaaggggag atgaaaaggc gaatggtttc 8880 agatgcatat gttccagaga tatggcgcag tgggccctgt agggtttatg ccgtctcatc 8940 tgtaaggctt gccaaacggc ccaaagacga tcaacgttag aatggtgaaa gtaaaacaaa 9000 ggatcgaaag ctgtgtaatg tagggacgac attgagaaat gttcgcttcc tcctgtccaa 9060 gcgtgaatcg tgttatgggt aatctcaaat tggacttcaa agtcacagta atcttcttgt 9120 tcaaaggcat acaaaatctg gttgaacagg tctgtgtggt gtccaaaggt agcgggttta 9180 aagagtcttt gatctatttg tcttttggtg tggggagcat tttcttcaaa tgcaatcact 9240 gaactatgaa aaggatttgt gtgggatgca ccatgagaat ccacataagt ttcgtttcct 9300 gccagacccg gaatgtgtgt catgggttca gtccagtccc agtatgggat accaatggta 9360 gctccacggc ctactaaagc atcttcaact tgtgtgacaa aaagcctgtg ccagtgaggg 9420 aagattggca ttccatggac acaacaagca tatctgtcct tggcatcagg gcgagggcat 9480 ctggccggaa gaccatgata ctcagctgtg gcttgataac cgtcgacgga gcgatcttct 9540 tggaatttat gaaacgcctc tctcagattc aacacttctt cacgggtgag atcattgatg 9600 ttctttctaa cgaggagatt ggcgctttgg tgatgcttca ccggaggatc agtgaaaccc 9660 ttctccggat gatgaacaac aactggatga ggtaacaaat catcgggaag agctgtacca 9720 ttgacactga aaagctctgc cttgacggta tagtgtccat ccaacttcat tcccatcttg 9780 tgaacagtgt cggacaagtc aaacaagtat ggcctgtaga actgccattt catctcgctt 9840 ggtccgccca gaacgaagaa atcaccggcc ttctcacagt ggtcatcatt acgatcgttg 9900 tcgtcgggga tgcagacgta cacaaagaca ttggctgaac ctccaaatcc tttcagacgg 9960 aagcttgcga aggctcttgc ccttgaagac cgctcatcca aataggtttt cagttcctct 10020 ggagtcatac cattaagatt caacgtgtca tagtcgtagt gaagttgacg atagtcaaag 10080 agatcggctg gagtggaatg ttcatttgtt agtggtatgt cattgtcctc ccaactgaaa 10140 ggttccattt tctgatgtaa caattcttgg gcacaatcca tagctttggg gtctttgttt 10200 cggagctctt ggagcctctg ccagatggcg aaaattctgt caacgttgga gtggtgaaga 10260 aagaagatcg gatcgtaggc ggtgtactcg aggtttgcca tagagtaatc atgctttcca 10320 ccgaccaggt agtgaatcgc attatgtgct aactcgaact ggatttcgaa tttacagaat 10380 tcatcctgct ccagtgcgtc caaaacactc tccataagat gggtgtgctg gcctggcttc 10440 accttctcaa aaagacggtc gtcgaccgca cgattggttt tcttgtctag aaagtcaatg 10500 ttccctctgt accagtagtt ggtgtgagcc tttccgccct gggaatcgat gtaaacaggg 10560 tccgcagcga gagagggcaa ctgggtgatg ggttctgtcc agtcccagta tggcatccca 10620 gaagtacgcc tcttcgtctg cagagctcgc tccatgtgaa ctacgtacgc cctgtgccag 10680 tgcgggaaag taggcatgcc gtgaatacag caagctatag aagtgtcctt atgcacacaa 10740 gatgcgggcg cgccatgagc agcagctatt ttctggaaac caatgctgga tgagtccatc 10800 tgcagctcac gcagggcagc ctgcaggtcc agggtctctt cctgagtcag atgctccaca 10860 ctctttctga ccaggttctc tgcattggcc aaagccaaca cgactataag caacctgacc 10920 gacagcattg cgcctctcca aatcgccag 10949 33 3414 PRT Megathura crenulata SEQ ID NO33 is the deduced amino acid sequence of the coding region of SEQ ID NO31 33 Met Leu Ser Val Arg Leu Leu Ile Val Val Leu Ala Leu Ala Asn Ala 1 5 10 15 Glu Asn Leu Val Arg Lys Ser Val Glu His Leu Thr Gln Glu Glu Thr 20 25 30 Leu Asp Leu Gln Ala Ala Leu Arg Glu Leu Gln Met Asp Ser Ser Ser 35 40 45 Ile Gly Phe Gln Lys Ile Ala Ala Ala His Gly Ala Pro Ala Ser Cys 50 55 60 Val His Lys Asp Thr Ser Ile Ala Cys Cys Ile His Gly Met Pro Thr 65 70 75 80 Phe Pro His Trp His Arg Ala Tyr Val Val His Met Glu Arg Ala Leu 85 90 95 Gln Thr Lys Arg Arg Thr Ser Gly Met Pro Tyr Trp Asp Trp Thr Glu 100 105 110 Pro Ile Thr Gln Leu Pro Ser Leu Ala Ala Asp Pro Val Tyr Ile Asp 115 120 125 Ser Gln Gly Gly Lys Ala His Thr Asn Tyr Trp Tyr Arg Gly Asn Ile 130 135 140 Asp Phe Leu Asp Lys Lys Thr Asn Arg Ala Val Asp Asp Arg Leu Phe 145 150 155 160 Glu Lys Val Lys Pro Gly Gln His Thr His Leu Met Glu Ser Val Leu 165 170 175 Asp Ala Leu Glu Gln Asp Glu Phe Cys Lys Phe Glu Ile Gln Phe Glu 180 185 190 Leu Ala His Asn Ala Ile His Tyr Leu Val Gly Gly Lys His Asp Tyr 195 200 205 Ser Met Ala Asn Leu Glu Tyr Thr Ala Tyr Asp Pro Ile Phe Phe Leu 210 215 220 His His Ser Asn Val Asp Arg Ile Phe Ala Ile Trp Gln Arg Leu Gln 225 230 235 240 Glu Leu Arg Asn Lys Asp Pro Lys Ala Met Asp Cys Ala Gln Glu Leu 245 250 255 Leu His Gln Lys Met Glu Pro Phe Ser Trp Glu Asp Asn Asp Ile Pro 260 265 270 Leu Thr Asn Glu His Ser Thr Pro Ala Asp Leu Phe Asp Tyr Arg Gln 275 280 285 Leu His Tyr Asp Tyr Asp Thr Leu Asn Leu Asn Gly Met Thr Pro Glu 290 295 300 Glu Leu Lys Thr Tyr Leu Asp Glu Arg Ser Ser Arg Ala Arg Ala Phe 305 310 315 320 Ala Ser Phe Arg Leu Lys Gly Phe Gly Gly Ser Ala Asn Val Phe Val 325 330 335 Tyr Val Cys Ile Pro Asp Asp Asn Asp Arg Asn Asp Asp His Cys Glu 340 345 350 Lys Ala Gly Asp Phe Phe Val Leu Gly Gly Pro Ser Glu Met Lys Trp 355 360 365 Gln Phe Tyr Arg Pro Tyr Leu Phe Asp Leu Ser Asp Thr Val His Lys 370 375 380 Met Gly Met Lys Leu Asp Gly His Tyr Thr Val Lys Ala Glu Leu Phe 385 390 395 400 Ser Val Asn Gly Thr Ala Leu Pro Asp Asp Leu Leu Pro His Pro Val 405 410 415 Val Val His His Pro Glu Lys Gly Phe Thr Asp Pro Pro Val Lys His 420 425 430 His Gln Ser Ala Asn Leu Leu Val Arg Lys Asn Ile Asn Asp Leu Thr 435 440 445 Arg Glu Glu Val Leu Asn Leu Arg Glu Ala Phe His Lys Phe Gln Glu 450 455 460 Asp Arg Ser Val Asp Gly Tyr Gln Ala Thr Ala Glu Tyr His Gly Leu 465 470 475 480 Pro Ala Arg Cys Pro Arg Pro Asp Ala Lys Asp Arg Tyr Ala Cys Cys 485 490 495 Val His Gly Met Pro Ile Phe Pro His Trp His Arg Leu Phe Val Thr 500 505 510 Gln Val Glu Asp Ala Leu Val Gly Arg Gly Ala Thr Ile Gly Ile Pro 515 520 525 Tyr Trp Asp Trp Thr Glu Pro Met Thr His Ile Pro Gly Leu Ala Gly 530 535 540 Asn Glu Thr Tyr Val Asp Ser His Gly Ala Ser His Thr Asn Pro Phe 545 550 555 560 His Ser Ser Val Ile Ala Phe Glu Glu Asn Ala Pro His Thr Lys Arg 565 570 575 Gln Ile Asp Gln Arg Leu Phe Lys Pro Ala Thr Phe Gly His His Thr 580 585 590 Asp Leu Phe Asn Gln Ile Leu Tyr Ala Phe Glu Gln Glu Asp Tyr Cys 595 600 605 Asp Phe Glu Val Gln Phe Glu Ile Thr His Asn Thr Ile His Ala Trp 610 615 620 Thr Gly Gly Ser Glu His Phe Ser Met Ser Ser Leu His Tyr Thr Ala 625 630 635 640 Phe Asp Pro Leu Phe Tyr Phe His His Ser Asn Val Asp Arg Leu Trp 645 650 655 Ala Val Trp Gln Ala Leu Gln Met Arg Arg His Lys Pro Tyr Arg Ala 660 665 670 His Cys Ala Ile Ser Leu Glu His Met His Leu Lys Pro Phe Ala Phe 675 680 685 Ser Ser Pro Leu Asn Asn Asn Glu Lys Thr His Ala Asn Ala Met Pro 690 695 700 Asn Lys Ile Tyr Asp Tyr Glu Asn Val Leu His Tyr Thr Tyr Glu Asp 705 710 715 720 Leu Thr Phe Gly Gly Ile Ser Leu Glu Asn Ile Glu Lys Met Ile His 725 730 735 Glu Asn Gln Gln Glu Asp Arg Ile Tyr Ala Gly Phe Leu Leu Ala Gly 740 745 750 Ile Arg Thr Ser Ala Asn Val Asp Ile Phe Ile Lys Thr Thr Asp Ser 755 760 765 Val Gln His Lys Ala Gly Thr Phe Ala Val Leu Gly Gly Ser Lys Glu 770 775 780 Met Lys Trp Gly Phe Asp Arg Val Phe Lys Phe Asp Ile Thr His Val 785 790 795 800 Leu Lys Asp Leu Asp Leu Thr Ala Asp Gly Asp Phe Glu Val Thr Val 805 810 815 Asp Ile Thr Glu Val Asp Gly Thr Lys Leu Ala Ser Ser Leu Ile Pro 820 825 830 His Ala Ser Val Ile Arg Glu His Ala Arg Val Lys Phe Asp Lys Val 835 840 845 Pro Arg Ser Arg Leu Ile Arg Lys Asn Val Asp Arg Leu Ser Pro Glu 850 855 860 Glu Met Asn Glu Leu Arg Lys Ala Leu Ala Leu Leu Lys Glu Asp Lys 865 870 875 880 Ser Ala Gly Gly Phe Gln Gln Leu Gly Ala Phe His Gly Glu Pro Lys 885 890 895 Trp Cys Pro Ser Pro Glu Ala Ser Lys Lys Phe Ala Cys Cys Val His 900 905 910 Gly Met Ser Val Phe Pro His Trp His Arg Leu Leu Thr Val Gln Ser 915 920 925 Glu Asn Ala Leu Arg Arg His Gly Tyr Asp Gly Ala Leu Pro Tyr Trp 930 935 940 Asp Trp Thr Ser Pro Leu Asn His Leu Pro Glu Leu Ala Asp His Glu 945 950 955 960 Lys Tyr Val Asp Pro Glu Asp Gly Val Glu Lys His Asn Pro Trp Phe 965 970 975 Asp Gly His Ile Asp Thr Val Asp Lys Thr Thr Thr Arg Ser Val Gln 980 985 990 Asn Lys Leu Phe Glu Gln Pro Glu Phe Gly His Tyr Thr Ser Ile Ala 995 1000 1005 Lys Gln Val Leu Leu Ala Leu Glu Gln Asp Asn Phe Cys Asp Phe 1010 1015 1020 Glu Ile Gln Tyr Glu Ile Ala His Asn Tyr Ile His Ala Leu Val 1025 1030 1035 Gly Gly Ala Gln Pro Tyr Gly Met Ala Ser Leu Arg Tyr Thr Ala 1040 1045 1050 Phe Asp Pro Leu Phe Tyr Leu His His Ser Asn Thr Asp Arg Ile 1055 1060 1065 Trp Ala Ile Trp Gln Ala Leu Gln Lys Tyr Arg Gly Lys Pro Tyr 1070 1075 1080 Asn Val Ala Asn Cys Ala Val Thr Ser Met Arg Glu Pro Leu Gln 1085 1090 1095 Pro Phe Gly Leu Ser Ala Asn Ile Asn Thr Asp His Val Thr Lys 1100 1105 1110 Glu His Ser Val Pro Phe Asn Val Phe Asp Tyr Lys Thr Asn Phe 1115 1120 1125 Asn Tyr Glu Tyr Asp Thr Leu Glu Phe Asn Gly Leu Ser Ile Ser 1130 1135 1140 Gln Leu Asn Lys Lys Leu Glu Ala Ile Lys Ser Gln Asp Arg Phe 1145 1150 1155 Phe Ala Gly Phe Leu Leu Ser Gly Phe Lys Lys Ser Ser Leu Val 1160 1165 1170 Lys Phe Asn Ile Cys Thr Asp Ser Ser Asn Cys His Pro Ala Gly 1175 1180 1185 Glu Phe Tyr Leu Leu Gly Asp Glu Asn Glu Met Pro Trp Ala Tyr 1190 1195 1200 Asp Arg Val Phe Lys Tyr Asp Ile Thr Glu Lys Leu His Asp Leu 1205 1210 1215 Lys Leu His Ala Glu Asp His Phe Tyr Ile Asp Tyr Glu Val Phe 1220 1225 1230 Asp Leu Lys Pro Ala Ser Leu Gly Lys Asp Leu Phe Lys Gln Pro 1235 1240 1245 Ser Val Ile His Glu Pro Arg Ile Gly His His Glu Gly Glu Val 1250 1255 1260 Tyr Gln Ala Glu Val Thr Ser Ala Asn Arg Ile Arg Lys Asn Ile 1265 1270 1275 Glu Asn Leu Ser Leu Gly Glu Leu Glu Ser Leu Arg Ala Ala Phe 1280 1285 1290 Leu Glu Ile Glu Asn Asp Gly Thr Tyr Glu Ser Ile Ala Lys Phe 1295 1300 1305 His Gly Ser Pro Gly Leu Cys Gln Leu Asn Gly Asn Pro Ile Ser 1310 1315 1320 Cys Cys Val His Gly Met Pro Thr Phe Pro His Trp His Arg Leu 1325 1330 1335 Tyr Val Val Val Val Glu Asn Ala Leu Leu Lys Lys Gly Ser Ser 1340 1345 1350 Val Ala Val Pro Tyr Trp Asp Trp Thr Lys Arg Ile Glu His Leu 1355 1360 1365 Pro His Leu Ile Ser Asp Ala Thr Tyr Tyr Asn Ser Arg Gln His 1370 1375 1380 His Tyr Glu Thr Asn Pro Phe His His Gly Lys Ile Thr His Glu 1385 1390 1395 Asn Glu Ile Thr Thr Arg Asp Pro Lys Asp Ser Leu Phe His Ser 1400 1405 1410 Asp Tyr Phe Tyr Glu Gln Val Leu Tyr Ala Leu Glu Gln Asp Asn 1415 1420 1425 Phe Cys Asp Phe Glu Ile Gln Leu Glu Ile Leu His Asn Ala Leu 1430 1435 1440 His Ser Leu Leu Gly Gly Lys Gly Lys Tyr Ser Met Ser Asn Leu 1445 1450 1455 Asp Tyr Ala Ala Phe Asp Pro Val Phe Phe Leu His His Ala Thr 1460 1465 1470 Thr Asp Arg Ile Trp Ala Ile Trp Gln Asp Leu Gln Arg Phe Arg 1475 1480 1485 Lys Arg Pro Tyr Arg Glu Ala Asn Cys Ala Ile Gln Leu Met His 1490 1495 1500 Thr Pro Leu Gln Pro Phe Asp Lys Ser Asp Asn Asn Asp Glu Ala 1505 1510 1515 Thr Lys Thr His Ala Thr Pro His Asp Gly Phe Glu Tyr Gln Asn 1520 1525 1530 Ser Phe Gly Tyr Ala Tyr Asp Asn Leu Glu Leu Asn His Tyr Ser 1535 1540 1545 Ile Pro Gln Leu Asp His Met Leu Gln Glu Arg Lys Arg His Asp 1550 1555 1560 Arg Val Phe Ala Gly Phe Leu Leu His Asn Ile Gly Thr Ser Ala 1565 1570 1575 Asp Val His Val Phe Val Cys Leu Pro Thr Gly Glu His Thr Lys 1580 1585 1590 Asp Cys Ser His Glu Ala Gly Met Phe Ser Ile Leu Gly Gly Gln 1595 1600 1605 Thr Glu Met Ser Phe Val Phe Asp Arg Leu Tyr Lys Leu Asp Ile 1610 1615 1620 Thr Lys Ala Leu Lys Lys Asn Gly Val His Leu Gln Gly Asp Phe 1625 1630 1635 Asp Leu Glu Ile Glu Ile Thr Ala Val Asn Gly Ser His Leu Asp 1640 1645 1650 Ser His Val Ile His Ser Pro Thr Ile Leu Phe Glu Ala Gly Thr 1655 1660 1665 Asp Ser Ala His Thr Asp Asp Gly His Thr Glu Pro Val Met Ile 1670 1675 1680 Arg Lys Asp Ile Thr Gln Leu Asp Lys Arg Gln Gln Leu Ser Leu 1685 1690 1695 Val Lys Ala Leu Glu Ser Met Lys Ala Asp His Ser Ser Asp Gly 1700 1705 1710 Phe Gln Ala Ile Ala Ser Phe His Ala Leu Pro Pro Leu Cys Pro 1715 1720 1725 Ser Pro Ala Ala Ser Lys Arg Phe Ala Cys Cys Val His Gly Met 1730 1735 1740 Ala Thr Phe Pro Gln Trp His Arg Leu Tyr Thr Val Gln Phe Gln 1745 1750 1755 Asp Ser Leu Arg Lys His Gly Ala Val Val Gly Leu Pro Tyr Trp 1760 1765 1770 Asp Trp Thr Leu Pro Arg Ser Glu Leu Pro Glu Leu Leu Thr Val 1775 1780 1785 Ser Thr Ile His Asp Pro Glu Thr Gly Arg Asp Ile Pro Asn Pro 1790 1795 1800 Phe Ile Gly Ser Lys Ile Glu Phe Glu Gly Glu Asn Val His Thr 1805 1810 1815 Lys Arg Asp Ile Asn Arg Asp Arg Leu Phe Gln Gly Ser Thr Lys 1820 1825 1830 Thr His His Asn Trp Phe Ile Glu Gln Ala Leu Leu Ala Leu Glu 1835 1840 1845 Gln Thr Asn Tyr Cys Asp Phe Glu Val Gln Phe Glu Ile Met His 1850 1855 1860 Asn Gly Val His Thr Trp Val Gly Gly Lys Glu Pro Tyr Gly Ile 1865 1870 1875 Gly His Leu His Tyr Ala Ser Tyr Asp Pro Leu Phe Tyr Ile His 1880 1885 1890 His Ser Gln Thr Asp Arg Ile Trp Ala Ile Trp Gln Ser Leu Gln 1895 1900 1905 Arg Phe Arg Gly Leu Ser Gly Ser Glu Ala Asn Cys Ala Val Asn 1910 1915 1920 Leu Met Lys Thr Pro Leu Lys Pro Phe Ser Phe Gly Ala Pro Tyr 1925 1930 1935 Asn Leu Asn Asp His Thr His Asp Phe Ser Lys Pro Glu Asp Thr 1940 1945 1950 Phe Asp Tyr Gln Lys Phe Gly Tyr Ile Tyr Asp Thr Leu Glu Phe 1955 1960 1965 Ala Gly Trp Ser Ile Arg Gly Ile Asp His Ile Val Arg Asn Arg 1970 1975 1980 Gln Glu His Ser Arg Val Phe Ala Gly Phe Leu Leu Glu Gly Phe 1985 1990 1995 Gly Thr Ser Ala Thr Val Asp Phe Gln Val Cys Arg Thr Ala Gly 2000 2005 2010 Asp Cys Glu Asp Ala Gly Tyr Phe Thr Val Leu Gly Gly Glu Lys 2015 2020 2025 Glu Met Pro Trp Ala Phe Asp Arg Leu Tyr Lys Tyr Asp Ile Thr 2030 2035 2040 Glu Thr Leu Asp Lys Met Asn Leu Arg His Asp Glu Ile Phe Gln 2045 2050 2055 Ile Glu Val Thr Ile Thr Ser Tyr Asp Gly Thr Val Leu Asp Ser 2060 2065 2070 Gly Leu Ile Pro Thr Pro Ser Ile Ile Tyr Asp Pro Ala His His 2075 2080 2085 Asp Ile Ser Ser His His Leu Ser Leu Asn Lys Val Arg His Asp 2090 2095 2100 Leu Ser Thr Leu Ser Glu Arg Asp Ile Gly Ser Leu Lys Tyr Ala 2105 2110 2115 Leu Ser Ser Leu Gln Ala Asp Thr Ser Ala Asp Gly Phe Ala Ala 2120 2125 2130 Ile Ala Ser Phe His Gly Leu Pro Ala Lys Cys Asn Asp Ser His 2135 2140 2145 Asn Asn Glu Val Ala Cys Cys Ile His Gly Met Pro Thr Phe Pro 2150 2155 2160 His Trp His Arg Leu Tyr Thr Leu Gln Phe Glu Gln Ala Leu Arg 2165 2170 2175 Arg His Gly Ser Ser Val Ala Val Pro Tyr Trp Asp Trp Thr Lys 2180 2185 2190 Pro Ile His Asn Ile Pro His Leu Phe Thr Asp Lys Glu Tyr Tyr 2195 2200 2205 Asp Val Trp Arg Asn Lys Val Met Pro Asn Pro Phe Ala Arg Gly 2210 2215 2220 Tyr Val Pro Ser His Asp Thr Tyr Thr Val Arg Asp Val Gln Glu 2225 2230 2235 Gly Leu Phe His Leu Thr Ser Thr Gly Glu His Ser Ala Leu Leu 2240 2245 2250 Asn Gln Ala Leu Leu Ala Leu Glu Gln His Asp Tyr Cys Asp Phe 2255 2260 2265 Ala Val Gln Phe Glu Val Met His Asn Thr Ile His Tyr Leu Val 2270 2275 2280 Gly Gly Pro Gln Val Tyr Ser Leu Ser Ser Leu His Tyr Ala Ser 2285 2290 2295 Tyr Asp Pro Ile Phe Phe Ile His His Ser Phe Val Asp Lys Val 2300 2305 2310 Trp Ala Val Trp Gln Ala Leu Gln Glu Lys Arg Gly Leu Pro Ser 2315 2320 2325 Asp Arg Ala Asp Cys Ala Val Ser Leu Met Thr Gln Asn Met Arg 2330 2335 2340 Pro Phe His His Glu Ile Asn His Asn Gln Phe Thr Lys Lys His 2345 2350 2355 Ala Val Pro Asn Asp Val Phe Lys Tyr Glu Leu Leu Gly Tyr Arg 2360 2365 2370 Tyr Asp Asn Leu Glu Ile Gly Gly Met Asn Leu His Glu Ile Glu 2375 2380 2385 Lys Glu Ile Lys Asp Lys Gln His His Val Arg Val Phe Ala Gly 2390 2395 2400 Phe Leu Leu His Gly Ile Arg Thr Ser Ala Asp Val Gln Phe Gln 2405 2410 2415 Ile Cys Lys Thr Ser Glu Asp Cys His His Gly Gly Gln Ile Phe 2420 2425 2430 Val Leu Gly Gly Thr Lys Glu Met Ala Trp Ala Tyr Asn Arg Leu 2435 2440 2445 Phe Lys Tyr Asp Ile Thr His Ala Leu His Asp Ala His Ile Thr 2450 2455 2460 Pro Glu Asp Val Phe His Pro Ser Glu Pro Phe Phe Ile Lys Val 2465 2470 2475 Ser Val Thr Ala Val Asn Gly Thr Val Leu Pro Ala Ser Ile Leu 2480 2485 2490 His Ala Pro Thr Ile Ile Tyr Glu Pro Gly Leu Asp His His Glu 2495 2500 2505 Asp His His Ser Ser Ser Met Ala Gly His Gly Val Arg Lys Glu 2510 2515 2520 Ile Asn Thr Leu Thr Thr Ala Glu Val Asp Asn Leu Lys Asp Ala 2525 2530 2535 Met Arg Ala Val Met Ala Asp His Gly Pro Asn Gly Tyr Gln Ala 2540 2545 2550 Ile Ala Ala Phe His Gly Asn Pro Pro Met Cys Pro Met Pro Asp 2555 2560 2565 Gly Lys Asn Tyr Ser Cys Cys Thr His Gly Met Ala Thr Phe Pro 2570 2575 2580 His Trp His Arg Leu Tyr Thr Lys Gln Met Glu Asp Ala Leu Thr 2585 2590 2595 Ala His Gly Ala Arg Val Gly Leu Pro Tyr Trp Asp Trp Thr Thr 2600 2605 2610 Ala Phe Thr Ala Leu Pro Thr Phe Val Thr Asp Glu Glu Asp Asn 2615 2620 2625 Pro Phe His His Gly His Ile Asp Tyr Leu Gly Val Asp Thr Thr 2630 2635 2640 Arg Ser Pro Arg Asp Lys Leu Phe Asn Asp Pro Glu Arg Gly Ser 2645 2650 2655 Glu Ser Phe Phe Tyr Arg Gln Val Leu Leu Ala Leu Glu Gln Thr 2660 2665 2670 Asp Phe Cys Gln Phe Glu Val Gln Phe Glu Ile Thr His Asn Ala 2675 2680 2685 Val His Ser Trp Thr Gly Gly Leu Thr Pro Tyr Gly Met Ser Thr 2690 2695 2700 Leu Glu Tyr Thr Thr Tyr Asp Pro Leu Phe Trp Leu His His Ala 2705 2710 2715 Asn Thr Asp Arg Ile Trp Ala Ile Trp Gln Ala Leu Gln Glu Tyr 2720 2725 2730 Arg Gly Leu Pro Tyr Asp His Ala Asn Cys Glu Ile Gln Ala Met 2735 2740 2745 Lys Arg Pro Leu Arg Pro Phe Ser Asp Pro Ile Asn His Asn Ala 2750 2755 2760 Phe Thr His Ser Asn Ala Lys Pro Thr Asp Val Phe Glu Tyr Ser 2765 2770 2775 Arg Phe Asn Phe Gln Tyr Asp Asn Leu Arg Phe His Gly Met Thr 2780 2785 2790 Ile Lys Lys Leu Glu His Glu Leu Glu Lys Gln Lys Glu Glu Asp 2795 2800 2805 Arg Thr Phe Ala Ala Phe Leu Leu His Gly Ile Lys Lys Ser Ala 2810 2815 2820 Asp Val Ser Phe Asp Val Cys Asn His Asp Gly Glu Cys His Phe 2825 2830 2835 Ala Gly Thr Phe Ala Ile Leu Gly Gly Glu His Glu Met Pro Trp 2840 2845 2850 Ser Phe Asp Arg Leu Phe Arg Tyr Asp Ile Thr Gln Val Leu Lys 2855 2860 2865 Gln Met His Leu Glu Tyr Asp Ser Asp Phe Thr Phe His Met Arg 2870 2875 2880 Ile Ile Asp Thr Ser Gly Lys Gln Leu Pro Ser Asp Leu Ile Lys 2885 2890 2895 Met Pro Thr Val Glu His Ser Pro Gly Gly Lys His His Glu Lys 2900 2905 2910 His His Glu Asp His His Glu Asp Ile Leu Val Arg Lys Asn Ile 2915 2920 2925 His Ser Leu Ser His His Glu Ala Glu Glu Leu Arg Asp Ala Leu 2930 2935 2940 Tyr Lys Leu Gln Asn Asp Glu Ser His Gly Gly Tyr Glu His Ile 2945 2950 2955 Ala Gly Phe His Gly Tyr Pro Asn Leu Cys Pro Glu Lys Gly Asp 2960 2965 2970 Glu Lys Tyr Pro Cys Cys Val His Gly Met Ser Ile Phe Pro His 2975 2980 2985 Trp His Arg Leu His Thr Ile Gln Phe Glu Arg Ala Leu Lys Lys 2990 2995 3000 His Gly Ser His Leu Gly Ile Pro Tyr Trp Asp Trp Thr Gln Thr 3005 3010 3015 Ile Ser Ser Leu Pro Thr Phe Phe Ala Asp Ser Gly Asn Asn Asn 3020 3025 3030 Pro Phe Phe Lys Tyr His Ile Arg Ser Ile Asn Gln Asp Thr Val 3035 3040 3045 Arg Asp Val Asn Glu Ala Ile Phe Gln Gln Thr Lys Phe Gly Glu 3050 3055 3060 Phe Ser Ser Ile Phe Tyr Leu Ala Leu Gln Ala Leu Glu Glu Asp 3065 3070 3075 Asn Tyr Cys Asp Phe Glu Val Gln Tyr Glu Ile Leu His Asn Glu 3080 3085 3090 Val His Ala Leu Ile Gly Gly Ala Glu Lys Tyr Ser Met Ser Thr 3095 3100 3105 Leu Glu Tyr Ser Ala Phe Asp Pro Tyr Phe Met Ile His His Ala 3110 3115 3120 Ser Leu Asp Lys Ile Trp Ile Ile Trp Gln Glu Leu Gln Lys Arg 3125 3130 3135 Arg Val Lys Pro Ala His Ala Gly Ser Cys Ala Gly Asp Ile Met 3140 3145 3150 His Val Pro Leu His Pro Phe Asn Tyr Glu Ser Val Asn Asn Asp 3155 3160 3165 Asp Phe Thr Arg Glu Asn Ser Leu Pro Asn Ala Val Val Asp Ser 3170 3175 3180 His Arg Phe Asn Tyr Lys Tyr Asp Asn Leu Asn Leu His Gly His 3185 3190 3195 Asn Ile Glu Glu Leu Glu Glu Val Leu Arg Ser Leu Arg Leu Lys 3200 3205 3210 Ser Arg Val Phe Ala Gly Phe Val Leu Ser Gly Ile Arg Thr Thr 3215 3220 3225 Ala Val Val Lys Val Tyr Ile Lys Ser Gly Thr Asp Ser Asp Asp 3230 3235 3240 Glu Tyr Ala Gly Ser Phe Val Ile Leu Gly Gly Ala Lys Glu Met 3245 3250 3255 Pro Trp Ala Tyr Glu Arg Leu Tyr Arg Phe Asp Ile Thr Glu Thr 3260 3265 3270 Val His Asn Leu Asn Leu Thr Asp Asp His Val Lys Phe Arg Phe 3275 3280 3285 Asp Leu Lys Lys Tyr Asp His Thr Glu Leu Asp Ala Ser Val Leu 3290 3295 3300 Pro Ala Pro Ile Ile Val Arg Arg Pro Asn Asn Ala Val Phe Asp 3305 3310 3315 Ile Ile Glu Ile Pro Ile Gly Lys Asp Val Asn Leu Pro Pro Lys 3320 3325 3330 Val Val Val Lys Arg Gly Thr Lys Ile Met Phe Met Ser Val Asp 3335 3340 3345 Glu Ala Val Thr Thr Pro Met Leu Asn Leu Gly Ser Tyr Thr Ala 3350 3355 3360 Met Phe Lys Cys Lys Val Pro Pro Phe Ser Phe His Ala Phe Glu 3365 3370 3375 Leu Gly Lys Met Tyr Ser Val Glu Ser Gly Asp Tyr Phe Met Thr 3380 3385 3390 Ala Ser Thr Thr Glu Leu Cys Asn Asp Asn Asn Leu Arg Ile His 3395 3400 3405 Val His Val Asp Asp Glu 3410 34 28 DNA Artificial 5′ RACE Primer 1 34 cggggtgaca gttgctgcta tcggtgca 28 35 24 DNA Artificial 3′ RACE Primer 2 35 gccttgaccg cccatggtgc caga 24 36 27 DNA Artificial PCR Primer 3 36 ggggcgaccg agttgtatcc actccca 27 37 27 DNA Artificial PCR Primer 4 37 ggctggaaca tttgcagtgc tcggtgg 27 38 49 DNA Artificial KLH2 Primer 1 (forward) 5′ to 3′ 38 aggctacgcg ttcgaaggag atagaaccat gtggaccatc ttggctctc 49 39 38 DNA Artificial KLH2 Primer 2 (reverse) 5′ to 3′ 39 ctggtgcggc cgccttactc gtcttcaatg tggatatg 38 40 28 DNA Artificial KLH1 Primer 1 (forward) 5′ to 3′ 40 atgctgtcgg tcaggttgct tatagtcg 28 41 29 DNA Artificial KLH1 Primer 2 (reverse) 5′ to 3′ 41 ctattcgtca tccacatgga catgtatcc 29 

1. An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding keyhole limpet hemocyanin (KLH)-2 or a fragment thereof comprising at least 1,080 contiguous nucleotides.
 2. The isolated or purified nucleic acid molecule of claim 1, which (i) encodes the amino acid sequence of SEQ ID NO: 3 or at least 360 contiguous amino acids of SEQ ID NO: 3, (ii) consists essentially of the nucleotide sequence of SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides and is characterized by no more than 40% mismatch, or (iv) shares 60% or more identity with SEQ ID NO:
 1. 3. The isolated or purified nucleic acid molecule of claim 2, which hybridizes under highly stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides and is characterized by no more than 20% mismatch.
 4. An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding KLH-2 or a fragment thereof comprising at least 1,080 contiguous nucleotides.
 5. The isolated or purified nucleic acid molecule of claim 4, which (i) is complementary to a nucleotide sequence encoding the amino acid sequence of SEQ ID NO: 3 or at least 360 contiguous amino acids of SEQ ID NO: 3, (ii) is complementary to the nucleotide sequence of SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 1 or a fragment thereof comprising at least 1,080 contiguous nucleotides and is characterized by no more than 40% mismatch, or (iv) shares 60% or more identity with the nucleotide sequence that is complementary to SEQ ID NO:
 1. 6. An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-1 or a fragment thereof comprising at least 2,300 contiguous nucleotides.
 7. The isolated or purified nucleic acid molecule of claim 6, which (i) encodes the amino acid sequence of SEQ ID NO: 24 or at least 770 contiguous amino acids of SEQ ID NO: 24, (ii) consists essentially of the nucleotide sequence of SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides and is characterized by no more than 35% mismatch, or (iv) shares 65% or more identity with SEQ ID NO:
 22. 8. The isolated or purified nucleic acid of claim 7, which hybridizes under highly stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of the nucleotide sequence that is complementary to SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides and is characterized by no more than 20% mismatch.
 9. An isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence that is complementary to a nucleotide sequence encoding KLH-1 or a fragment thereof comprising at least 2,300 contiguous nucleotides.
 10. The isolated or purified nucleic acid molecule of claim 9, which (i) is complementary to a nucleotide sequence encoding the amino acid sequence of SEQ ID NO: 24 or at least 770 contiguous amino acids of SEQ ID NO: 24, (ii) is complementary to the nucleotide sequence of SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides, (iii) hybridizes under moderately stringent conditions to an isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 22 or a fragment thereof comprising at least 2,300 contiguous nucleotides and is characterized by no more than 35% mismatch, or (iv) shares 65% or more identity with the nucleotide sequence that is complementary to SEQ ID NO:
 22. 11. A vector comprising the isolated or purified nucleic acid molecule of claim 1, optionally as part of an encoded fusion protein.
 12. A vector comprising the isolated or purified nucleic acid molecule of claim
 4. 13. A vector comprising the isolated or purified nucleic acid molecule of any of claim 6, optionally as part of an encoded fusion protein.
 14. A vector comprising the isolated or purified nucleic acid molecule of claim
 9. 15. A cell comprising and expressing the isolated or purified nucleic acid molecule of claim 1, optionally in the form of a vector.
 16. A cell comprising and expressing the isolated or purified nucleic acid molecule of claim 4, optionally in the form of a vector.
 17. A cell comprising and expressing the isolated or purified nucleic acid molecule of any of claim 6, optionally in the form of a vector.
 18. A cell comprising and expressing the isolated or purified nucleic acid molecule of claim 9, optionally in the form of a vector.
 19. An isolated or purified polypeptide molecule consisting essentially of an amino acid sequence of KLH-2 or at least 360 contiguous amino acids of KLH-2, which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt and/or dimerized or polymerized, wherein the isolated or purified polypeptide molecule is essentially free from KLH-1.
 20. A conjugate or fusion protein comprising the isolated or purified polypeptide molecule of claim 19 and a therapeutically or prophylactically active agent.
 21. The conjugate or fusion protein of claim 20, wherein the active agent is an immunogen, a hapten, an allergen, an antigen of a pathogen, an antigen of a cancer, hemocyanin β-adrenergic receptor peptide, or a sperm coat protein.
 22. A composition comprising the isolated or purified polypeptide molecule of claim 19, optionally in the form of a conjugate or a fusion protein comprising a therapeutically or prophylactically active agent, and an excipient or an adjuvant.
 23. An isolated or purified polypeptide molecule consisting essentially of an amino acid sequence of KLH-1 or at least 770 contiguous amino acids of KLH-1, which is optionally glycosylated, amidated, carboxylated, phosphorylated, esterified, N-acylated or converted into an acid addition salt and/or dimerized or polymerized, wherein the isolated or purified polypeptide molecule is essentially free from KLH-2.
 24. A conjugate or fusion protein comprising the isolated or purified polypeptide molecule of claim 23 and a therapeutically or prophylactically active agent.
 25. The conjugate or fusion protein of claim 24, wherein the active agent is an immunogen, a hapten, an allergen, an antigen of a pathogen, an antigen of a cancer, hemocyanin β-adrenergic receptor peptide, or a sperm coat protein.
 26. A composition comprising the isolated or purified polypeptide molecule of claim 23, optionally in the form of a conjugate or a fusion protein comprising a therapeutically or prophylactically active agent, and an excipient or an adjuvant.
 27. An isolated or purified nucleic acid molecule consisting essentially of nucleotides 1-1,260 of SEQ ID NO: 22, nucleotides 1-1,266 of SEQ ID NO: 1, nucleotides 8,695-10,197 of SEQ ID NO: 22, nucleotides 8,713-10,206 of SEQ ID NO: 1, nucleotides 1,261-1,527 of SEQ ID NO: 22, nucleotides 1,267-1,530 of SEQ ID NO: 1, or nucleotides 7,975-8,694 of SEQ ID NO:
 22. 28. An isolated or purified polypeptide molecule consisting essentially of amino acids 1-420 of SEQ ID NO: 24, amino acids 1-422 of SEQ ID NO: 3, amino acids 2,899-3,398 of SEQ ID NO: 24, amino acids 2,905-3,401 of SEQ ID NO: 3, amino acids 421-509 of SEQ ID NO: 24, amino acids 423-510 of SEQ ID NO: 3, or amino acids 2,659-2,898 of SEQ ID NO:
 24. 29. A method of treating cancer prophylactically or therapeutically in a mammal, which method comprises administering to the mammal an effective amount of: (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the cancer, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector, (b) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the cancer, (c) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the cancer or an anti-cancer agent, or (d) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the cancer and/or an anti-cancer agent, whereupon the mammal is treated for the cancer prophylactically or therapeutically.
 30. The method of claim 29, wherein the cancer is adenocarcinoma, bladder cancer, colon cancer, breast cancer, lung cancer or skin cancer.
 31. The method of claim 30, wherein the skin cancer is melanoma.
 32. A method of treating a mammal prophylactically or therapeutically for a pathogenic infection, which method comprises administering to the mammal an effective amount of: (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the pathogen, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector, (b) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen of the pathogen, (c) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the pathogen or an anti-pathogen agent, or (d) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen of the pathogen and/or an anti-pathogen agent, whereupon the mammal is treated for the pathogenic infection prophylactically or therapeutically.
 33. A method of stimulating an immune response in a mammal, which method comprises administering to the mammal an effective amount of: (a) an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-2 or an immunogenic fragment thereof comprising at least 1,080 contiguous nucleotides, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector, (b) an isolated or purified nucleic acid molecule consisting essentially of a nucleotide sequence encoding KLH-1 or an immunogenic fragment thereof comprising at least 2,300 contiguous nucleotides, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector, (c) an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (d) an isolated or purified polypeptide molecule consisting essentially of an amino acid sequence encoding KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, whereupon an immune response in the mammal is stimulated.
 34. The method of claim 33, in which the immune response in the mammal that is stimulated results in the inhibition of tyrosinase.
 35. A method of stimulating an immune response to an antigen in a mammal, which method comprises administering to the mammal an effective amount of dendritic cells which have been previously isolated from the mammal and treated in vitro with: (a) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) an antigen, (b) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen, or (c) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) an antigen, whereupon an immune response to the antigen is induced in the mammal.
 36. The method of claim 35, wherein the antigen is an antigen of a pathogen or an antigen of a cancer.
 37. The method of claim 36, wherein the method further comprises the administration of an anti-pathogen agent or an anti-cancer agent.
 38. A method of treating high blood pressure in a mammal, which method comprises administering to the mammal an effective amount of: (a) an isolated or purified nucleic acid molecule encoding a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) hemocyanin β-adrenergic receptor peptide, wherein the isolated or purified nucleic acid molecule is optionally in the form of a vector, (b) a fusion protein comprising (i) KLH or an immunogenic fragment thereof, and (ii) hemocyanin β-adrenergic receptor peptide, (c) a conjugate comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) hemocyanin β-adrenergic receptor peptide, or (d) a composition comprising (i) (a) KLH-2 or an immunogenic fragment thereof comprising at least 360 contiguous amino acids, wherein the KLH-2 is essentially free from KLH-1, or (b) KLH-1 or an immunogenic fragment thereof comprising at least 770 contiguous amino acids, wherein the KLH-1 is essentially free from KLH-2, and (ii) hemocyanin β-adrenergic receptor peptide, whereupon the mammal is treated for high blood pressure prophylactically or therapeutically.
 39. An isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 4 or encoding SEQ ID NO:
 6. 40. An isolated or purified nucleic acid molecule consisting essentially of the isolated or purified nucleic acid molecule of claim 39 coupled to an isolated or purified nucleic acid molecule encoding a polypeptide or protein, optionally in the form of a vector, which, upon expression in a eukaryotic cell, results in the secretion of the encoded polypeptide or protein by the eukaryotic cell.
 41. A eukaryotic cell comprising and expressing the isolated or purified nucleic acid molecule of claim
 40. 42. An isolated or purified nucleic acid molecule consisting essentially of SEQ ID NO: 25 or encoding SEQ ID NO:
 27. 43. An isolated or purified nucleic acid molecule consisting essentially of the isolated or purified nucleic acid molecule of claim 42 coupled to an isolated or purified nucleic acid molecule encoding a polypeptide or protein, optionally in the form of a vector, which, upon expression in a eukaryotic cell, results in the secretion of the encoded polypeptide or protein by the eukaryotic cell.
 44. A eukaryotic cell comprising and expressing the isolated or purified nucleic acid molecule of claim
 43. 45. A method of identifying an immunogenic portion of KLH, wherein the KLH is KLH-1 in the absence of KLH-2 or wherein the KLH is KLH-2 in the absence of KLH-1, which method comprises: (i) contacting antigen-presenting cells with KLH or a portion thereof, (ii) harvesting the antigen-presenting cells that have been contacted with KLH or a portion thereof, (iii) purifying complexes of major histocompatibility complex (MHC) and peptidic fragments derived from KLH from the harvested antigen-presenting cells, (iv) isolating the peptidic fragments from the purified complexes, and (v) analyzing the peptidic fragments to determine which fragments were bound to MHC, whereupon an immunogenic portion of KLH is identified. 46-47. (Cancelled).
 48. An immunogenic portion of KLH-1 identified in accordance with the method of claim
 45. 49. An immunogenic portion of KLH-2 identified in accordance with the method of claim
 45. 